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(57) Abstract: PDEIOA, a gene that is normally highly expressed in mammalian striatum and elsewhere, has been found to decrease 
in expression during the development of CAG repeat disorders such as Huntington's disease. The invention teaches a method for 
detecting the presence of or the predisposition for a CAG repeat disorder. Compounds which modulate CAG repeat disorders and 
their uses are taught. Methods for screening for further compounds to modulate CAG repeat disoiders arc also taught 
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Gene Necessary for Striatal Function. Uses Thereof, and 
Compounds for Modulating Same 

CROSS-REFERENCE 

This patent claims priority from Canadian Patent application no. 2,285,690 filed October 7, 
1999, US provisional application no. 60/158,043 filed October 7, 1999, and US provisional 
application no. 60/217,765 filed July 12, 2000, entitled Gene Necessary for Striatal Function, 
Uses Thereof, and Compounds for Modulating Same. 

FIELD OF THE INVENTION 

The present invention relates to a polynucleotide, PDEIOA, which is down-regulated during 
the development of CAG repeat disorders, such as Huntington's disease. The present 
invention also describes compounds that modulate CAG repeat disorders, processes for 
expressing PDEIOA, and its agonists and antagonists, and uses of PDEIOA, and its variants, 
derivatives, agonists and antagonists. 

BACKGROUND OF THE INVENTION 

Very few if any effective treatments exist for neurological disorders characterized by 
progressive cell loss, known as neurodegenerative diseases, as well as those involving acute 
cell loss, such as stroke and trauma. 

Huntington's disease (HD) is an inherited neurological disorder that is transmitted in 
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autosomal dominant fashion. HD results from genetically programmed degeneration of 
neurons in certain areas of the brain. Huntington's disease is caused by a mutation of the 
gene IT-IS that codes for the protein huntingtin. The huntingtin gene contains a polymorphic 
stretch of repeated CAG trinucleotides that encode a polyglutamine tract within huntingtin. If 
this tract exceeds 35 in number, Huntington's disease results. Huntington's disease is only 
one of a number of neurological diseases which are characterised by these polyglutamine 
repeats (Ross, 1997). Schizophrenia, Alzheimer's disease, stroke, trauma, and Parkinson's 
disease also affect the basal ganglia. 

Huntingtin has no sequence similarity to known proteins (Group THDCR, 1993; Sisodia, 
1998). The function of the normal or mutated HD form of huntingtin has not been defined by 
the prior art. It is evident, however, that the expression of the HD form of huntingtin leads to . 
progressive and selective neuronal loss. It has been demonstrated that the GABA- and 
enkephalin-containing medium spiny projection neurons of the caudate-putamen eventually 
die as a result of HD (Richfield et al., 1994). Patients with minimal cell loss, however, still 
present with motor and cognitive symptoms suggesting that neuronal dysfunction, and not 
simply cell loss, contribute to the symptoms of HD. The motor symptoms of HD include the 
development of chorea, dystonia, bradykinesia and tremors (Young et al., 1986). Voluntary 
movements may also be affected such that there may be disturbances in speech (Ludlow et 
al., 1987) and degradation of fine motor co-ordination (Young et al., 1986). In addition to 
motor decline, emotional disturbances and cognitive loss are also evident during the 
progression of HD (Caine et al., 1978). 

Despite the fact that huntingtin is ubiquitously expressed, HD specifically affects cells of the 
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basal ganglia, structures deep within the brain that have a number of unportant functions, 
including co-ordinating movement. The basal ganglia includes the caudate nucleus, the 
putamen, the nucleus accumbens and the olfactory tubercule. HD also affects the brain's 
outer surface, or cortex, which controls thought, perception, and memory. The mechanism by 
which only a small group of neurons in the striatum and cortex are rendered vuhierable to this 
ubiquitously expressed mutant protein is not known. There are no effective treatments for 
Huntington's disease. 

Huntington's disease is widely believed to be a gain-of function disorder but neither the 
normal function nor the gained function of huntingtin is known. Because the function for 
huntingtin is not known, there is little insight into the disease process. It was believed that 
huntingtin was related to neuronal intranuclear inclusions (Nil). However, recent results have 
cast doubt on our understanding of the role of the Nil in Huntington's disease (Saudou et al, 
1998) or in other CAG repeat disorders (Klement et al, 1998; see also commentary by 
Sisodia, 1998). 

The development of a mouse carrying the 5' end of the hxmian Huntington's disease gene (the 
promoter and first exon; Mangiarini et al., 1996) was an important step in the development 
of the tools that will allow us to understand the function (and gain-of-fimction) associated 
with huntingtin. R6/2 mice exhibit a rapidly progressing neurological phenotype with onset 
at about 8 weeks. This phenotype includes a movement disorder characterised by shuddering, 
resting tremor, epileptic seizures and stereotyped behaviour. These symptoms suggest that 
the function of the basal ganglia is affected by the expression of the human exon 1 transgehe 
prior to neuronal cell death. By 12 weeks the affected mice have significantly reduced brain 
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weights and they die by about 13 weeks of age. Neuronal intranuclear inclusions (Nil) 
develop at about 4 weeks (Davies et al., 1997). As is observed in human Huntington's 
disease patiait, the R6/2 mice show changes in neuronal receptors (Cha et al., 1998). The 
present inventors have also demonstrated that changes in the expression of DARPP-32 and 
cannabinoid receptors change over time in HD mice; such changes have also been observed 
in human Huntington's disease patients (unpublished results). The loss of the cannabinoid 
receptor is one of the earliest documented changes that occur prior to neuronal degeneration 
in human HD patients. The R6/2 model, therefore, mimics the early phases of HD; a point in 
disease development where intervention would be most appropriate. 

Human PDEIO was recently identified by identification of cDNA fi-agments published on the 
National Center for Biotechnology Information (NCBI) E^qiressed Sequence Tags (EST) 
database (Loughney et al., W099/42596). While PDEIO was found to share homology with 
known PDEs, no fimction could be identified for PDEIO. 

SUMMARY OF THE INVENTION 

The present invention provides the fimction and uses of a nucleotide segment, PDEIOA, and 
compounds which inhibit or promote the development of CAG repeat disorders such as 
Huntington's Disease. 

The invention teaches a method for identifying a compound which inhibits or promotes a 
CAG repeat disorder, comprising the stq)s of: (a) selecting a control animal having PDEIOA 
and a test animal having PDEIOA; (b) treating said test animal using a compound; and (c) 
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determining the relative quantity of RNA corresponding to PDEIOA, as between said 
animals. In an embodiment, the animal is a mammal, preferably a mouse, and preferably a 
transgenic mouse. In an embodiment, the CAG repeat disorder is Huntington's disease. 

The invention also teaches a method for identifying a compound which inhibits or promotes a 
CAG repeat disorder, comprising the steps of: (a) selecting a host cell containing PDEIOA; 
(b) cloning said host cell and separating said clones into a test group and a control group; (c) 
treating said test group using a compound; and (c) determining the relative quantity of RNA 
corresponding to PDEIOA, as between said test group and said control group. In an 
embodiment, the CAG repeat disorder is Huntington's disease. 

The invention further teaches a method for detecting the presence of or the predisposition for 
a CAG repeat disorder, said method comprising determining the level of expression of RNA 
corresponding to PDEIOA in an individual relative to a predetermined control level of 
expression, wherein a decreased expression of said RNA as compared to said control is 
indicative of a CAG repeat disorder. Preferably, the expression is measured by in situ 
hybridization, fluorescent in situ hybridization, polymerase chain reaction, or DNA 
fingerprinting technique. In an embodiment, the CAG repeat disorder is Huntington's 
disease. 

The invention further teaches compositions for treating a CAG repeat disorder comprising a 
compound which modulates PDEIO expression and a pharmaceutically acceptable carrier. 
The compound can be selected fi-om the group consisting of: quinpirole, alloxan, miconazole 
nitrate, MDL-12330A and tetracyline derivatives such as demeclocycline. The compound 
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may be selected from the group consisting of: (6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5- 
benzofuranyl)-2-methyl-pyrazino[2', 1 ':6, 1 ]pyrido[3,4-b]indole- 1 ,4-dione, 
(6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofui^yl)-pyi^o[2\l':6,l]pyridQ[^ 
]indole-l,4-dione, (6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-2-isopropyI- 
pyra2ino[2\1^6,l]pyrido[3,4-b]indole-l,4-dione,(3S,6RJ2aR)-2,3,6,7,12,i2a-Hexahydro-6- 
(5-benzofuranyl)-3-methyl-pyrazino[ 2', 1 ^e, 1 ]pyrido[3,4-b]indple- 1 ,4-dione, and 

(3S,6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-ben2ofuranyl)-2,3-dimethyl-pyraz 
ino[2',l':6,l]pyrido[3,4-b]indole-l,4-dione, or from the group consisting of: KS-505, 
IC224,SCH 51866, fflMX and Dipyridamole. The disorder can be HD. 

The invention also teaches the use of a composition which modulates PDEIO for treating a 
CAG repeat disorder comprising administering the composition to a subject in need of such 
treatmoit, and such use of the composition which modulates PDEIO for treating HD. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a portion of an autoradiogram of the differential display reaction identifying 
PDEl OA in mouse brain mRNA. 

FIG. 2 is a northern blot confirming that PDEl OA has a lower steady-state level of expression 
in the striatum of transgenic HD mice. 

FIG. 3 is a nucleotide sequence of the differential display cDNA fragment of pPDElOA. 
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FIG. 4 shows the in situ hybridization of probe 1 to coronal and saggital brain sections of 10 
week-old wild-type and HD mice. 



FIG, 5 shows the in situ hybridization corresponding to spatial and temporal expression of 
PDEIOA in brain sections of wild-type and HD mice over the period of time that the HD 
mice develop abnormal movements and postures. 

FIG. 6 shows the in situ hybridization corresponding to expression of PDEIOA in brain 
sections of one day old wild-type and HD mice. 

FIG. 7 shows the in situ hybridization corresponding to distribution of the mRNA of 
PDEIOA in mouse striatal neurons. 

FIG. 8 is the in situ hybridization corresponding to mRNA distribution of the rat homologue 
of PDEIOA in rat brain tissue. 

FIG. 9 shows a Southern blot analysis of DNA from wild-type and transgenic HD mice 
hybridized to the pPDElOA cDNA probe. 

FIG. 10 is a nucleotide sequence of cPDElO-1, and corresponds to SEQ ID NO, 1. 
FIG. 1 1 is a restriction map of cPDElO-1. 

FIG. 12 is a nucleotide sequence of cPDElO-2, and corresponds to SEQ ID NO, 2. 
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FIG. 14 is a schematic diagram showing the alignment of cPDElO-I and -2 and the regions 
that are identical and unique between the two clones. 

FIG. 15 is a nucleotide sequence of cPDElOA and RACEs, corresponding to SEQ ID NO. 11. 
FIG. 16 is a map of PDEIOA coding sequence and restriction sites. 
FIG. 17 is a map of PDEIOA coding sequence and features. 
FIG. 1 8 is a restriction map of PDEl OA. 

FIG. 19 is a nucleotide sequence of cPDElOA and corresponds to SEQ ID NO. 12. 

DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION 

The following illustrative explanations are provided to facihtate understanding of certain 
terms used frequently herein. The explanations are provided as a convenience and are not 
limitative of the invention. 

"Host cell" is a cell which has been transformed or transfected, or is capable of 
transformation or transfection by an exogenous polynucleotide sequence. 

"Identity", "similarity" or "homologous", as used in the art, are relationships between two or 
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more polynucleotide sequences, as detennined by comparing the sequences. In the art, 
identity also means flie degree of sequence reiatedness between polynucleotide sequences, as 
the case may be, as detennined by the match between strings of such sequences. Both 
identity and similarity can be readily calculated (Lesk, A. M., 1988; Smith, D. W., 1993; 
Grififm, A. M., and Griffin, H. G., 1994; von Heinje, G., 1987; and Gribskov, M. and 
Devereux, J,, 1 991). While there exist a number of methods to measure identity and 
similarity between two polynucleotide sequences, both terms are well known to skilled 
artisans (von Heinje, G., 1987; Gribskov, M. and Devereux, 1991; and Carillo, H., and 
Lipman, D., 1988), Methods commonly employed to determine identity or similarity 
between sequences include, but are not limited to those disclosed in Carillo, H., and Lipman, 
D. (1988). Methods to determine identity and similarity are codified in computer programs. 
Computer program methods to determine identity and similarity between two sequences 
include, but are not limited to, GCG program package (Devereux, J., et al., 1984), BLASTP, 
BLASTN, and FASTA (Atschul, S. F. et al., 1990). 

"Isolated" means altered "by the hand of man" from its natural state; i.e. , that, if it occurs in 
nature, it has been changed or removed from its original environment, or both. For example, 
a naturally occurring polynucleotide naturally present in a living organism in its natural state 
is not "isolated," but the same polynucleotide separated from coexisting materials of its 
natural state is "isolated", as the term is employed herein. As part of or following isolation, 
such polynucleotides can be joined to other polynucleotides, such as DNA, for mutagenesis, 
to form fusion proteins, and for propagation or expression in a host, for instance. The 
isolated polynucleotides, alone or joined to other polynucleotides such as vectors, can be 
introduced into host cells, in culture or in whole organisms. Introduced into host cells in 
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culture or in whole organisms, such DNA still would be isolated, as the term is used herein, 
because they would not be in their naturally occurring form or environment. Similarly, the 
polynucleotides may occur in a composition, such as a media formulations, solutions for 
introduction of polynucleotides, for example, into cells, compositions or solutions for 
chanical or enzymatic reactions, for instance, which are not naturally occurring 
compositions, and, therein remain isolated polynucleotides within the meaning of that term as 
it is employed herein. 

'Tlasmids". Starting plasmids disclosed herein are either commercially available, publicly 
available, or can be constructed from available plasmids by routine application of well 
known, published procedures. Many plasmids and other cloning and expression vectors that 
can be used in accordance with the present invention are well known and readily available to 
those of skill in the art. Moreover, those of skill readily may construct any number of other 
plasmids suitable for use in the invention. 

"Polynucleotides(s)" of the present invention may be in the form of RNA, such as mRNA, or 
in the fonn of DNA, including, for instance, cDNA and genomic DNA obtained by cloning or 
produced by chemical synthetic techniques or by a combination thereof The DNA may be 
double-stranded or single-stranded. Single-stranded polynucleotides may be the coding 
strand, also known as the sense strand, or it may be the non-coding strand, also referred to as 
the anti-sense strand. Polynucleotides generally refers to any polyribonucleotide or 
polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 
Thus, for instance, polynucleotides as used herein refers to, among others, single-and double- 
stranded DNA, DNA that is a mixture of single- and double-sti-anded regions or single-. 
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double- and triple-stranded regions, single- and double-stranded RNA, and RNA that is 
mixture of single- and double-stranded regions, hybrid molecules comprising DNA and RNA 
that may be single-stranded or, more typically, double-stranded, or triple-stranded, or a 
mixture of single- and double-stranded regions. In addition, polynucleotide as used herein 
refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The 
strands in such regions may be from the same molecule or from different molecules. The 
regions may include all of one or more of the molecules, but more typically involve only a 
region of some of the molecules. One of the molecules of a triple-helical region often is an 
oligonucleotide. As used herein, the term polynucleotide also includes DNA or DNA fliat 
contain one or more modified bases. Thus, DNA or DNA with backbones modified for 
stability or for other reasons are "polynucleotides" as that term is intended herein. Moreover, 
DNA or DNA comprising unusual bases, such as inosine, or modified bases, such as 
tritylated bases, to name just two examples, are polynucleotides as the term is used herein. It 
will be appreciated that a great variety of modifications have been made to DNA and RNA 
that serve many useful purposes known to those of skill in the art. The tenn polynucleotide as 
it is employed herein embraces such chemically, enqanatically or metabolically modified 
forms of polynucleotides, as well as the chemical forms of DNA and RNA characteristic of 
viruses and cells, including simple and complex cells, inter alia. Polynucleotides embraces 
short polynucleotides often referred to as oligonucleotide(s). It will also be appreciated that 
RNA made by transcription of this doubled stranded nucleotide sequence, and an antisoise 
strand of a nucleic acid molecule of the invention or an oligonucleotide fragment of the 
nucleic acid molecule, are contemplated within the scope of the invention. An antisense 
sequence is constructed by inverting the sequence of a nucleic acid molecule of the invention, 
relative to its normal presentation for transcription. Preferably, an antisense sequence is 
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constructed by inverting a region preceding the initiation codon or an unconserved region. 
The antisense sequences may be constructed using chemical synthesis and enzymatic ligation 
reactions using procedures known in the art. 

"Stringent hybridization conditions*' are those which are stringent enough to provide 
specificity, reduce the number of mismatches and yet are sufficiently flexible to allow 
foraiation of stable hybrids at an acceptable rate. Such conditions are known to those skilled 
in the art and are described, for example, in Sambrook, et al, (1989). By way of example 
only, stringent hybridization with short nucleotides may be carried out at 5-10** below the 
using high concentrations of probe such as 0.01-1.0 pmole/mL Preferably, the term "stringent 
conditions" means hybridization will occur only if there is at least 95% and preferably at least 
97% identity between the sequences. 

"Variant(s)" of polynucleotides are polynucleotides that differ in nucleotide sequence from 
another, reference polynucleotide. Generally, differences are limited so that the nucleotide 
sequences of the reference and the variant are closely similar overall and, in many regions, 
identical. Changes in the nucleotide sequence of the variant may be silent. That is, they may 
not alter the amino acids encoded by the polynucleotide. Where alterations are limited to 
silent changes of this type a variant will encode a polypeptide or polynucleotide with the 
same amino acid sequence as the referrace. Changes in the nucleotide sequence of the 
variant may alter the amino acid sequence of a polypeptide encoded by the reference 
polynucleotide. Such nucleotide changes may result in amino acid substitutions, additions, 
deletions, fusions and truncations in the polypeptide or polynucleotide encoded by the 
reference sequence. 
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As hereinbefore mentioned, the present inventors have identified and sequenced a DNA 
sequence encoding PDEIOA. The DNA sequence is shown in the Sequence Listing as SEQ 
IDN0:l,N0:2andN0:ll. 

It will be appreciated that the invention includes nucleotide or amino acid sequences which 
have substantial sequence homology with the nucleotide sequences shown in the Sequence 
Listing as SEQ ID N0:1, N0:2 and N0:11. The term "sequences having substantial 
sequence homology" means those nucleotide and amino acid sequences which have slight or 
inconsequential sequence variations from the sequences disclosed in the Sequence Listing as 
SEQ ID NO: 1 , NO:2 and NO: 1 1 ; i.e. the homologous sequences function in substantially 
the same manner to produce substantially the same polypeptides as the actual sequences. The 
variations may be attributable to local mutations or structural modifications. It is expected 
that a sequence having 85-90% sequence homology with the DNA sequence of the invention 
will provide a fimctional PDEIO polypeptide. 

As used herein, "PDEIOA'^ comprises a polynucleotide sequence which is down regulated in 
the course of CAG repeat disorders selected fiom the group consisting of: (a) a sequence 
comprising SEQ ID NO:l; (b) a sequence comprising SEQ ID N0:2; (c) a sequence 
comprising SEQ ID NO: 11; (d) a sequence comprising nucleotides 257 to 2596 of SEQ ID 
N0:11; (e) a sequence which is at least 90% homologous with a sequence of (a), (b), (c) or 
(d), and; (f) a sequence which hybridizes to (a), (b), (c) or (d) under stringent conditions. In 
an embodiment, the isolated polynucleotide segment is cDNA. The invention also teaches an 
isolated polynucleotide segment, which retains substantially the same biological function or 
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activity as the polynucleotide encoded by the polynucleotide sequence. 
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Further embodiments of the invention are polynucleotides that are at least 70% identical over 
their entire length to. a polynucleotide encoding PDEIO polypeptide or polynucleotide, and 
polynucleotides which are complementary to such polynucleotides. Other embodiments are 
polynucleotides that comprise a region that is at least 80% identical over their entire length to 
a polynucleotide encoding PDEl 0 of SEQ ID NO. 1 1 and polynucleotides complementary 
thereto. This includes polynucleotides at least 90% identical over their entire length to the 
same, and among these embodiments are polynucleotides with at least 95%. Furthemiore, 
those with at least 97% are highly preferred among those with at least 95%, and among these 
those with at least 98% and at least 99% are particularly highly preferred, with at least 99% 
being the more preferred. 

The polynucleotides of the present invention may be employed as research reagents and 
materials for discovery of treatments of and diagnostics for disease, particularly human 
disease, as further discussed herein. 

Analysis of the complete nucleotide and amino acid sequences of the protein of the invention 
using the procedures of Sambrook et al., supra, have been used to determine the expressed 
region, initiation codon and untranslated sequencqs of the PDEIOA gene. The transcription 
regulatory sequences of the gene are determined by analyzing fragments of the DNA for their 
ability to express a reporter gene such as the bacterial gene lacZ. 

The nucleic acid molecules of the invention allow those skilled in the art to construct 
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nucleotide probes for use in the detection of nucleotide sequences in biological materials. As 
shown in FIG. 1 1, 13, 15 and 16, a number of unique restriction sequences for restriction 
enzymes are incorporated in the nucleic acid molecule identified in the Sequence Listing as 
SEQ ID N0:1, NO:2 and N0:1 1, and these provide access to nucleotide sequences which 
code for polypeptides unique to the PDEIOA polypeptide of the invention. Nucleotide 
sequences unique to PDEIOA or isoforms thereof, can also be constructed by chemical 
synthesis and enzymatic ligation reactions carried out by procedures known in the art. 

A nucleotide probe may be labeled with a detectable marker such as a radioactive label which 
provides for an adequate signal and has sufficient half-life such as 32p, 3H, 14C or the like. 
Other detectable markers which may be used include antigens that are recognized by a 
specific labeled antibody, fluorescent compounds, enzymes, antibodies specific for a labeled 
antigen, and chemiluminescent compounds. An appropriate label may be selected having 
regard to the rate of hybridization and binding of the probe to the nucleotide to be detected 
and the amount of nucleotide available for hybridization. The nucleotide probes may be used 
to detect genes related to or analogous to PDEIOA of the invention. 

Accordingly, the present invention also provides a method of detecting the presence of 
nucleic acid molecules encoding a polypeptide related to or analogous to PDEIOA in a 
sample comprising contacting the sample under hybridization conditions with one or more of 
the nucleotide probes of the invention labeled with a detectable marker, and determining the 
degree of hybridization between the nucleic acid molecule in the sample and the nucleotide 
probes. 



15 



wo 01/24781 PCT/CAOO/01188 

Hybridization conditions which may be used in the method of the invention are known in the 
art and are described for example in Sambrook J, et al., supra. The hybridization product 
may be assayed using techniques known in the art. The nucleotide probe may be labeled with 
a detectable marker as described herein and the hybridization product may be assayed by 
detecting the detectable marker or the detectable change produced by the detectable marker. 

The nucleic acid molecule of the invention also permits the identification and isolation, or 
synthesis of nucleotide sequences which may be used as primers to amplify a polynucleotide 
molecule of the invention, for example in polymerase chain reaction (PCR). The length and 
bases of the primers for use in the PCR are selected so that they will hybridize to different 
strands of the desired sequence and at relative positions along the sequence such that an 
extension product synthesized from one primer when it is separated from its template can 
serve as a template for extension of the other primer into a nucleic acid of defined length. 

Primers which may be used in the invention are oligonucleotides i.e. molecules containing 
two or more deoxyribonucleotides of the nucleic acid molecule of the invention which occur 
naturally as in a purified restriction endonuclease digest or are produced synthetically using 
techniques known in the art such as, for example, phosphbtriester and phosphodiester 
methods (See Good et al, 1977) or automated techniques (see, for example, ConoUy, B. A., 
1 987). The primers are enable of acting as a point of initiation of synthesis when placed 
under conditions which permit the synthesis of a primer extension product which is 
complementary to the DNA sequence of the invention e.g. in the presence of nucleotide 
substrates, an agent for polymerization such as DNA polymerase and at suitable temperature 
and pH. Preferably, the primers are sequences that do not form secondary structures by base 
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pairing with other copies of the primer or sequences that form a hair pin configuration. The 
primer may be single or double-stranded. When the primer is double-stranded it may be 
treated to separate its strands before using it to prepare amplification products. The primer 
preferably contains between about 7 and 25 nucleotides. 

The primers may be labeled with detectable markers which allow for detection of the 
amplified products. Suitable detectable markers are radioactive markers such as P-32, S-35, 1- 
125, and H-3, luminescent markers such as chemiluminescent markers, preferably luminol, 
and fluorescent markers, preferably dansyl chloride, fluorcein-5-isothiocyanate, and 4-fluor- 
7-nitrobenz-2-axa-l,3 diazole, enzyme markers such as horseradish peroxidase, alkaUne 
phosphatase, .beta.-galactosidase, acetylchohnesterase, or biotin. 

It will be appreciated that the primers may contain non-complementary sequences provided 
that a sufficient amount of the primer contains a sequence which is complementary to a 
nucleic acid molecule of the invention or oligonucleotide sequence thereof, which is to be 
amplified. Restriction site linkers may also be incoiporated into the primers allowing for 
digestion of the amplified products with the appropriate restriction enzymes facilitating 
cloning and sequencing of the amplified product. 

Thus, a method of determining the presence of a nucleic acid molecule having a sequence 
encoding PDEIOA or a predetermined oligonucleotide firagment thereof in a sample, is 
provided comprising treating the sample with primers which are capable of amplifying the 
nucleic acid molecule or the predeteraiined oUgonucleotide firagment thereof in a polymerase 
chain reaction to form amplified sequences, under conditions which permit the formation of 
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The polymerase chain reaction refers to a process for amplifying a target nucleic acid 
sequence as generally described in Innis et al, Academic Press, 1989, in MuUis el al., U,S, 
Pat. No. 4,863,195 and Mullis, U.S. Pat. No. 4,683,202 which are incorporated herein by 
reference. Conditions for amplifying a nucleic acid template are described in M. A. Innis and 
D. H. Gelfand, 1989, which is also incorporated herein by reference. 

The amplified products can be isolated and distinguished based on their respective sizes using 
techniques known in the art. For example, after amplification, the DNA sample can be 
separated on an agarose gel and visualized, after staining with ethidium bromide, under ultra 
violet (UV) light. DNA may be amplified to a desired level and a fiirther extension reaction 
may be performed to incorporate nucleotide derivatives having detectable markers such as 
radioactive labeled or biotin labeled nucleoside triphosphates. The primers may also be 
labeled with detectable markers. The detectable markers may be analyzed by restriction and 
electrophoretic separation or other techniques known in the art. 

The conditions which may be employed in the methods of the invention using PGR are those 
which permit hybridization and amplification reactions to proceed in the presence of DNA in 
a sample and ^propriate complementary hybridization primers. Conditions suitable for the 
polymerase chain reaction are generally known in the art. For example, see M. A. Innis and 
D. H. Gelfand, 1989, which is incorporated herein by reference. Preferably, the PGR utilizes 
polymerase obtained fi-om the thermophilic bacterium Thermus aquatics (Taq polymerase, 
GeneAmp Kit, Perkin Ehner Cetus) or other thermostable polymerase may be used to amplify 
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DNA template strands. 

It will be appreciated that other techniques such as the Ligase Chain Reaction (LCR) and 
Nucleic-Acid Sequence Based Amplification (NASBA) may be used to amplify a nucleic 
acid molecule of the invention. In LCR, two primers which hybridize adjacent to each other 
on the target strand are ligated in the presence of the target strand to produce a 
complementary strand (Barney, 1991 and European Published Apphcation No. 0320308, 
published Jun. 14, 1989). NASBA is a continuous amplification method using two primers, 
one incorporating a promoter sequence recognized by an RNA polymerase and the second 
derived from the complementary sequence of the target sequence to the first primer (U.S. Ser. 
No. 5,130,238 to Malek). 

The present invention also teaches vectors which comprise a poljmucleotide or 
polynucleotides of the present mvention, host cells which are genetically engineered with 
vectors of the invention and the production of polynucleotides of the invention by 
recombinant techniques. 

In accordance with this aspect of the invention the vector may be, for example, a plasmid 
vector, a single or double-stranded phage vector, a single or double-stranded RNA or DNA 
viral vector. In certain embodiments in this regard, the vectors provide for specific 
expression. Such specific expression may be inducible expression or expression only in 
certain types of cells or both inducible and cell-specific. Particular among inducible vectors 
are vectors that can be induced for expression by environmental factors that are easy to 
manipulate, such as temperature and nutrient additives. A variety of vectors suitable to this 
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aspect of the invention, including constitutive and inducible expression vectors for use in 
prokaryotic and eukaryotic hosts, are well known and employed routinely by those of skill in 
the art. Such vectors include, among others, chromosomal, episomal and virus-derived 
vectors, e.g., vectors derived from bacterial plasmids, from bacteriophage, from transposons, 
from yeast episomes, from insertion elements, from yeast chromosomal elements, from 
viruses such as baculoviruses, papova viruses, such as S V40, vaccinia viruses, adenoviruses, 
fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from 
combinations thereof, such as those derived from plasmid and bacteriophage genetic 
elements, such as cosmids and phagemids, all may be used for expression in accordance with 
this aspect of the present invention. 

The following vectors, which are conunercially available, are provided by way of example. 
Among vectors for use in bacteria are pQE70, pQE60 and pQE-9, available from Qiagen; 
pBS vectors, Phagescript vectors, Bluescript vectors, pNHSA, pNH16a, pNHlSA, pNH46A, 
available from Stratagene; and ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available 
from Pharmacia, and pBR322 (ATCC 37017). Among eukaryotic vectors are pWLNEO, 
pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and pSVK3, pBPV, pMSG 
and pSVL available from Pharmacia. These vectors are listed solely by way of illustration of 
the many commercially available and well known vectors that are available to those of skill in 
the art for iise in accordance with this aspect of the present mvention. It will be appreciated 
that any other plasmid or vector suitable for, for example, introduction, maintenance, 
propagation or expression of a polynucleotide or polypeptide of the invention in a host may 
be used in this aspect of the invention. Generally, any vector suitable to maintain, propagate 
or express polynucleotides to express a polypeptide or polynucleotide in a host may be used 
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The appropriate DNA sequence may be inserted into the vector by any of a variety of well- 
known and routine techniques. In general, expression constructs will contain sites for 
transcription initiation and termination, and, in the transcribed region, a ribosome binding site 
for translation. The coding portion of the mature transcripts expressed by the constructs will 
include a translation initiating AUG at the beginning and a termination codon appropriately 
positioned at the end of the polynucleotide to be translated. 

The DNA sequence in the expression vector is operatively linked to appropriate expression 
control sequence(s), including, for instance, a promoter to direct mRNA transcription. 
Promoter regions can be selected from any desired gene using vectors that contain a reporter 
transcription unit lacking a promoter region, such as a chloramphenicol acetyl transferase 
("CAT") transcription unit, downstream of restriction site or sites for introducing a candidate 
promoter fragment; i.e., a fragment that may contain a promoter. As is well known, 
introduction into the vector of a promoter-containing fragment at the restriction site upstream 
of the cat gene engenders production of CAT activity, which can be detected by standard 
CAT assays. Vectors suitable to this end are well known and readily available, such as 
pKK232-8 and pCM7. Promoters for expression of polynucleotides of the present invention 
include not only well known and readily available promoters, but also promoters that readily 
may be obtained by the foregoing technique, using a reporter gene. Among known 
prokaryotic promoters suitable for expression of polynucleotides and polypeptides in 
accordance with the present invention are the E. coli lad and lacZ and promoters, the T3 and 
T7 promoters, the gpt promoter, the lambda PR, PL promoters and the trp promoter. Among 
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known eukaryotic promoters suitable in this regard are the CMV immediate early promoter, 
the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters of 
retroviral LTRs, such as those of the Rous sarcoma virus ("RSV"), and metallothionein 
promoters, such as the mouse metallothionein-I promoter. 

Vectors for propagation and expression generally will include selectable markers and 
amplification regions, such as, for example, those set forth in Sambrook et al., supra. 

As hereinbefore mentioned, the present invention also teaches host cells which are genetically 
engineered with vectors of the invention. 

Polynucleotide constructs in host cells can be used in a conventional manner to produce the 
gene product encoded by the recombinant sequence. The PDEIOA polynucleotide or . 
polypeptide products or isoforms or parts thereof, may be obtained by expression in a suitable 
host cell using techniques known in the art. Suitable host cells include prokaryotic or 
eukaryotic organisms or cell lines, for example bacterial, mammalian, yeast, or other fimgi, 
viral, plant or insect cells. Methods for transforming or transfecting cells to express foreign 
DNA are well known in the art (See for example, Itakura et al„ U.S. Pat. No. 4,704,362; 
Hinnen et al., 1978; Murray et al, U.S. Pat. No. 4,801,542; Upshall et al., U.S. Pat. No, 
4,935,349; Hagen et al., U.S. Pat. No. 4,784,950; Axel et al., U.S. Pat. No. 4,399,216; 
Goeddal et al., U.S. Pat. No. 4,766,075; and Sambrook et al, 1989, all of which are 
incorporated herein by reference). Representative examples of appropriate hosts include 
bacterial cells, such as streptococci, staphylococci, E. coli, streptomyces and Bacillus subtilis 
cells; fungal cells, such as yeast cells and Aspergillus cells; insect cells such as Drosophila S2 
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and Spodoptera S£9 cells; animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, 293 and 
Bowes melanoma cells; and plant cells. 



Host cells can be genetically engineered to incorporate polynucleotides and express 
polynucleotides of the present invention. Introduction of polynucleotides into the host cell 
can be affected by calcium phosphate transfection, DEAE-dextrari mediated transfection, 
transvection, microinjection, cationic lipid-mediated transfection, electroporation, 
transduction, scrape loading, ballistic introduction, infection or other methods. Such methods 
are described in many standard laboratory manuals, such as Davis et al. (1986) and Sambrook 
etal.(1989). 

As hereinbefore mentioned, the present invention also teaches the production of 
polynucleotides of the invention by recombinant techniques. 

The PDEIO polynucleotides encode a polypeptide which is the mature protein plus additional 
amino or carboxyl-teraiinal amino acids, or amino acids interior to the mature polypeptide 
(when the mature form has more than one polypeptide chain, for instance). Such sequences 
may play a role in processing of a protein from precursor to a mature form, may allow protein 
transport, may lengthen or shorten protein half-life or may facilitate manipulation of a protein 
for assay or production, among other things. As generally is the case in vivo, tiie additional 
amino acids may be processed away from the mature protein by cellular enzymes. 

A precursor protein, having the mature form of the polypeptide fused to one or more 
prosequences may be an inactive fonn of the polypeptide. When prosequences are removed 
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such inactive precursors generally are activated. Some or all of the prosequences may be 
removed before activation. Generally, such precursors are called proproteins. 



In sum, a polynucleotide of the present invention may encode a mature protein, a mature 
protein plus a leader sequence (which may be referred to as a preprotein), a precursor of a 
mature protein having one or more prosequences which are not the leader sequences of a 
preprotein, or a preproprotein, which is a precursor to a proprotein, having a leader sequence 
and one or more prosequences, which generally are removed during processing steps that 
produce active and mature forms of the polypeptide. 

The polypeptides of the invention may be prepared by culturing the host/vector systems 
described above, in order to express the recombinant polypeptides. Recombinantly produced 
PDEIOA based protein or parts thereof, may be further purified using techniques known in 
the art such as commercially available protein concentration systems, by salting out the 
protein followed by dialysis, by affinity chromatography, or using anion or cation exchange 
resins. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the 
control of appropriate promoters. Cell-free translation systems can also be employed to 
produce such proteins using DNA derived from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are 
described by Sambrook et al., supra. 

Polynucleotides of the invention, encoding the heterologous structural sequence of a 
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polynucleotide or polypeptide of the invention generally will be inserted into a vector using 
standard techniques so that it is operably linked to the promoto- for expression. The 
polynucleotide will be positioned so that the transcription start site is located appropriately 5' 
to a ribosome binding site. The ribosome binding site will be 5' to the AUG that initiates 
translation of the polynucleotide or polypeptide to be expressed. Generally, there will be no 
other open reading frames that begin with an initiation codon, usually AUG, and lie between 
the ribosome binding site and the initiation codon. Also, generally, there will be a translation 
stop codon at the end of the expressed polynucleotide and there will be a polyadenylation 
signal in constructs for use in eukaryotic hosts. Transcription termination signal appropriately 
disposed at the 3* end of the transcribed region may also be included in the polynucleotide 
construct. 

For secretion of the translated protein into the lumen of the endoplasmic reticulum, into the 
periplasmic space or into the extracellular environment, appropriate secretion signals may be 
incorporated into the expressed polynucleotide or polypeptide, These signals may be 
endogenous to the polynucleotide or they may be heterologous signals. Microbial cells 
employed in expression of proteins can be disrupted by any convenient method, including 
freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents, such 
methods are well know to those skilled in the art. PDEIOA polynucleotide or polypeptide 
can be recovered and purified firom recombinant cell cultures by well-known methods 
including ammoniimi sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatogr2q)hy, hydrophobic interaction 
chromatography, affmity chromatography, hydroxylapatite chroniatography and lectin 
chromatography. Most preferably, high performance liquid chromatography is employed for 
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purification. Well known techniques for refolding protein may be employed to regenerate 
active conformation when the polynucleotide is denatured during isolation and or 
purification. 



In an embodiment, a nucleic acid molecule of the invention may be cloned into a glutathione 
S-transferase (GST) gene fiision system for example the pGEX-1 T, pGEX-2T and pGEX-3X 
of Pharmacia. The fiised gene may contain a strong lac promoter, inducible to a high level of 
expression by IPTG, as a regulatory element. Thrombin or factor Xa cleavage sites may be 
present which allow proteolytic cleavage of the desired polypeptide firom the fiision product. 
The glutathione S-transferase-PDElOA fiision protein may be easily purified using a 
glutathione sepharose 4B column, for example fi-om Pharmacia. The 26 kd glutathione S- 
transferase polypeptide can be cleaved by thrombin (pGEX-1 or pGEX-2T) or factor Xa 
(pGEX-3X) and resolved firom the using the polypeptide using the same affinity column. 
Additional chromatographic steps can be included if necessary, for example Sephadex or 
DEAE cellulose. The two enzymes may be monitored by protein and enzymatic assays and 
purity may be confirmed using SDS-PAGE. 

The PDEIOA protein or parts thereof may also be prepared by chemical synthesis using 
techniques well known in the chemistry of proteins such as solid phase synthesis (Merrifield, 
1 964) or synthesis in homogenous solution (Houbenweyl, 1 987). 

Within the context of the present invention, PDEIOA polypeptide includes various structural 
forms of the primary protein which retain biological activity. For example, PDEIOA 
polypeptide may be in the form of acidic or basic salts or in neutral form. In addition. 
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individual amino acid residues may be modified by oxidation or reduction. Furthermore, 
various substitutions, deletions or additions may be made to the amino acid or nucleic acid 
sequences, the net effect being that biological activity of PDEIOA is retained. Due to code 
degeneracy, for example, there may be considerable variation in nucleotide sequences 
encoding the same amino acid. 

The polypeptide may be expressed in a modified form, such as a fiision protein, and may 
include not only secretion signals but also additional heterologous fimctional regions. Thus, 
for instance, a region of additional amino acids, particularly charged amino acids, may be 
added to the C- or N-terminus of the polypeptide to improve stability and persistence in the 
host cell, during purification or during subsequent handling and storage. Also, fiision proteins 
may be added to the polynucleotide or polypeptide to facilitate purification. Such regions 
may be removed prior to final preparation of the polynucleotide or polypeptide. The addition 
of peptide moieties to polynucleotide or polypeptides to engender secretion or excretion, to 
improve stability or to facilitate purification, among others, are fandliar and routine 
techniques in the art. In drug discovery, for example, proteins have been fiised with antibody 
Fc portions for the purpose of high-throughput screening assays to identify antagonists (see 
Bennett et al., 1995, and Johanson et al.,1995). 

Detecting Presence of or Predisposition for CAG Repeat Disorders 

This mvention is also related to the use of the PDEIOA polynucleotides to detect 
complementary polynucleotides as a diagnostic reagent. Detection of the level of expression 
of PDEIOA in a eukaryote, particularly a mammal, and especially a human, will provide a 
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method for diagnosis of a disease. Eukaryotes (herein also "individual(s)"), particularly 
mammals, and especially humans, exhibiting decreased levels of PDEIOA may be detected 
by a variety of techniques. Nucleic acids for diagnosis may be obtained from an infected 
individual's cells and tissues, such as the striatum, nucleus accumbens and olfactory 
tubercule. RNA may be used directly for detection or may be amplified enzymatically by 
using PGR (Saiki et al., 1986) prior to analysis. As an example, PGR primers complementary 
to the nucleic acid encoding PDEIOA can be used to identify and analyze PDEIOA presence 
and/or expression. Using PGR, characterization of the level of PDEIOA present in the 
individual may be made by comparative analysis. 

The invention thus provides a process for detecting disease by using methods knovm in tiie 
art and methods described herein to detect decreased expression of PDEIO polynucleotide. 
For example, decreased expression of PDEIO polynucleotide can be measured using any on 
of the methods well known in the art for the quantification of polynucleotides, such as, for 
example, PGR, RT-PGR, DNAse protection, northern blotting and other hybridization 
methods. Thus, the present invention provides a method for detecting triplet-repeat disorders, 
and a method for detecting a genetic pre-disposition for triplet-repeat disorders and other 
disorders of the basal ganglia includmg schizophrenia, stroke, trauma, Parkinson's disease 
and Alzheimer's disease (AD). More generally, the present invention provides a method for 
detecting a genetic pre-disposition for neiu-ological disorders characterized by progressive 
cell loss. 
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The invention also provides a method of screening compounds to identify those which 
enhance (agonist) or block (antagonist) the action of PDEIO polypeptides or polynucleotides, 
such as its interaction with PDE 1 0-binding molecules. The identification of mutations in 
specific genes in inherited neurodegenerative disorders, combined with advances in the field 
of transgenic methods, provides those of skill in the art with the information necessary to 
fiirther study human diseases. This is extraordinarily usefiil in modeling familial forms of 
triplet-repeat disorders and other disorders of the basal ganglia including schizophrenia, 
stroke, trauma, Parkinson's disease and Alzheimer's disease (AD). More generally, the 
present invention is usefiil for modeling neurological disorders characterized by progressive 
cell loss, as well as those involving acute cell loss, such as stroke and trauma. 

For example, to screen for agonists or antagonists, a synthetic reaction mix, a cellular 
compartment, such as a membrane, cell envelope or cell wall, or a preparation of any thereof, 
may be prepared fi-om a cell that expresses a molecule that binds PDEIO. The preparation is 
incubated with labeled PDEIO in the absence or the presence of a candidate molecule which 
may be a PDEIO agonist or antagonist. The ability of the candidate molecule to bind the 
binding molecule is reflected in decreased binding of the labeled ligand. 

PDElO-like effects of potential agonists and antagonists may by measured, for instance, by 
determining activity of a reporter system following interaction of the candidate molecule with 
a cell or appropriate cell preparation, and comparing the effect with that of PDEIO or 
molecules that elicit the same effects as PDEIO. Reporter systems that may be useful in this 
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regard include, but are not limited to, colorimetric labeled substrate converted into product, a 
reporter gene that is responsive to changes in PDEIO activity, and binding assays known in 
the art. 

Another example of an assay for PDEIO antagonists is a competitive assay that combines 
PDEIO and a potential antagonist with membrane-bound PDElO-binding molecules, 
recombinant PDEIO binding molecules, natural substrates or ligands, or substrate or ligand 
mimetics, under appropriate conditions for a competitive inhibition assay. PDEIO can be 
labeled, such as by radioactivity or a colorimetric compound, such that the number of PDEIO 
molecules bound to a binding molecule or converted to product can be determined accurately 
to assess the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, peptides, polypeptides and antibodies 
that bind to a polynucleotide or polypeptide of the invention and thereby inhibit or extinguish 
its activity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a closely related protein or antibody that binds the same sites on a 
binding molecule, such as a binding molecule, without inducing PDElO-induced activities, 
thereby preventing the action of PDEIO by excluding PDEIO from binding. 

Potential antagonists include a small molecule which binds to and occupies the binding site 
of the polypeptide thereby preventing binding to cellular binding molecules, such that normal 
biological activity is prevented. Examples of small molecules include but are not limited to 
small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, 1988, for a description of these molecules). 
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Developing modulators of the biological activities of specific PDEs requires differentiating 
PDE isozymes present in a particular assay preparation. The classical enzymological 
approach of isolating PDEs fi^om natural tissue sources and studying each new isozyme may 
be used. Another approach has been to identify assay conditions which might favor the 
contribution of one isozyme and minimize the contribution of others in a preparation. Still 
another approach has been the separation of PDEs by immunological means. Each of the 
foregoing approaches for differentiating PDE isozymes is time consuming. As a result many 
attempts to develop selective PDE modulators have been performed with preparations 
containing more than one isozyme. Moreover, PDE preparations firom natural tissue sowces 
are susceptible to limited proteolysis and may contain mixtures of active proteolytic products 
that have different kinetic, regulatory and physiological properties than the fiill length PDEs. 

Recombinant PDEIO polypeptide products of the invention greatly facilitate the development 
of new and specific PDEIO modulators. The need for purification of an isozyme can be 
avoided by expressing it recombinantly in a host cell that lacks endogenous 
phosphodiesterase activity (e.g., yeast strain YKS45 deposited as ATCC 74225). Once a 
compound that modulates the activity of the PDEIO is discovered, its selectivity can be 
evaluated by comparing its activity on the PDEIO to its activity on other PDE isozymes. 
Thus, the combination of the recombinant PDEIO products of the invention with other 
recombinant PDE products in a series of independent assays provides a system for developing 
selective modulators of PDEIO. Selective modulators may include, for example, antibodies 
and other proteins or peptides which specifically bind to the PDEIO or PDEIO nucleic acid, 
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oligonucleotides which specifically bind to the PDE 1 0 (see Patent Cooperation Treaty 
International Publication No. WO93/05182 published Mar. 18, 1993 which describes 
methods for selecting oligonucleotides which selectively bind to target biomolecules) or 
PDE 10 nucleic acid (e.g., antisense oligonucleotides) and other non-peptide natural or 
synthetic compounds which specifically bind to the PDE 10 or PDE 10 nucleic acid. Mutant 
forms of the PDEIO which alter the enzymatic activity of the PDEIO or its localization in a 
cell are also contemplated. Crystallization of recombinant PDEIO alone and bound to a 
modulator, analysis of atomic structure by X-ray crystallography, and computer modelling of 
those stmctures are methods useful for designing and optimizing non-peptide selective 
modulators. See, for example, Erickson et aL, Ann. Rep, Med. Chem,, 27: 271-289 (1992) for 
a general review of structure-based drug design. 

Targets for the development of selective modulators include, for example: (1) the regions of 
the PDEIO which contact other proteins and/or localize the PDEIO within a cell, (2) the 
regions of the PDEIO which bind substrate, (3) the allosteric cGMP-binding site(s) of 
PDEIO, (4) the metal-binding regions of the PDEIO, (5) the phosphorylation site(s) of PDEIO 
and (6) the regions of the PDEIO which are involved in dimerization of PDEIO subunits. 

Thus, the present invention provides a method for screening and selecting compoimds which 
promote triplet-repeat disorders, and a method for screening and selecting compounds which 
treat or inhibit triplet-repeat disorders, as well as schizophrenia, stroke, trauma, Parkinson*s 
disease and Alzheunert disease. More generally, the present invention provides a method for 
screening and selecting compounds which promote or inhibit neurological disorders 
characterized by progressive cell loss, as well as those involving acute cell loss, such as 
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The selected antagonists and agonists may be administered, for instance, to inhibit 
progressive and acute neurological disorders, such as Huntington's disease, Parkinson's 
disease, schizophrenia, Alzheimer's disease (AD), stroke or trauma. 

Antagonists and agonists and other compounds of the present invention may be employed 
alone or in conjunction with other compounds, such as therapeutic compounds. The 
pharmaceutical compositions may be administered in any effective, convenient manner 
including, for mstance, administration by direct microinjection into the affected area, or by 
intravenous or other routes. These compositions of the present invention may be employed in 
combination with a non-sterile or sterile carrier or carriers for use with cells, tissues or 
organisms, such as a pharmaceutical carrier suitable for administration to a subject. Such 
compositions comprise, for instance, a media additive or a therapeutically effective ainount of 
antagonists or agonists of the invention and a pharmaceutically acceptable carrier or 
excipient. Such carriers may include, but are not hmited to, saline, buffered saline, dextrose, 
water, glycerol, ethanol and combinations thereof The formulation is prepared to suit the 
mode of administration. 

Inhibition of PDEIOA will be highly detrimental to striatal brain function. The progressive 
decline in PDEIOA mRNA levels in HD may lead to dysregulation of c AMP levels and 
neuronal dysfunction. Up-regulation of PDEIOA will be effective in combating such 
neuronal dysfunction. 
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A variety of gene therapy j^proaches may be used in accordance with the invention to 
modulate expression of the PDEIOA gene in vivo. For example, antisense DNA molecules 
may be engineered and used to block translation of PDEIOA mRNA in vivo. Alternatively, 
ribozyme molecules may be designed to cleave and destroy the PDEIOA mRNAs in vivo. In 
another altemative, oligonucleotides designed to hybridize to the 5' region of the PDEIOA 
gene (including the region upstream of the coding sequence) and form triple helix structures 
may be used to block or reduce transcription of the PDEIOA gene. In yet another alternative, 
nucleic acid encoding the full length wild-type PDEIOA message may be introduced in vivo 
into cells which otherwise would be unable to produce the wild-type PDEIOA gene product 
in sufficient quantities or at all. 

In a preferred embodiment, the antisense, ribozyme and triple helix nucleotides are designed 
to inhibit the translation or transcription of PDEIOA. To accomplish this, the 
oligonucleotides used should be designed on the basis of relevant sequences unique to 
PDEIOA. 

For example, and not by way of limitation, the oligonucleotides should not fall within those 
region where the nucleotide sequence of PDEIOA is most homologous to that of other PDEs, 
such as PDE2 PDES and PDE6, herein referred to as "unique regions". 

In the case of antisense molecules, it is preferred that the sequence be chosen from the unique 
regions. It is also preferred that the sequence be at least 1 8 nucleotides in length in order to 
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achieve sufficiently strong annealing to the target mRNA sequence to prevent translation of 
the sequence. Izant and Weintraub, 1984, Cell, 36:1007-1015; Rosenberg et al., 1985, Nature, 
313:703-706. 



In the case of the "hammerhead" type of ribozymes, it is also preferred that the target 
sequences of the ribozymes be chosen from the unique regions. Ribozymes are RNA 
molecules which possess highly specific endoribonuclease activity. Hammerhead ribozymes 
comprise a hybridizing region which is complementary in nucleotide sequence to at least part 
of the target RNA, and a catalytic region which is adapted to cleave the target RNA. The 
hybridizing region contains nine (9) or more nucleotides. Therefore, the hammerhead 
ribozymes of the present invention have a hybridizing region which is complementary to the 
sequences listed above and is at least nine nucleotides in length. The construction and 
production of such ribozymes is well known in the art and is described more fully in Haseloff 
and Gerlach, 1988, Nature, 334:585-591. 

The ribozymes of the present invention also include RNA endoribonucleases (hereinafter 
"Cech-type ribozymes") such as the one which occurs naturally in Tetrahymena Thermophila 
(known as the IVS, or L-19 IVS RNA) and which has been extensively described by Thomas 
Cech and collaborators (Zaug, et al., 1984, Science, 224:574-578; Zaug and Cech, 1986, 
Science, 231:470-475; Zaug, et al., 1986, Nature, 324:429-433; published International patent 
application No. WO 88/04300 by University Patents Inc.; Been and Cech, 1986, Cell, 47:207- 
216). The Cech endoribonucleases have an eight base pair active site which hybridizes to a 
target RNA sequence whereafter cleavage of the target RNA takes place. The invention 
encompasses those Cech-type ribozymes which target eight base-pair active site sequences 
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that are present in PDEl OA but not other PDEs. 

The foregoing compounds can be administered by a variety of methods which are known in 
the art including, but not limited to the use of liposomes as a delivery vehicle. Naked DNA or 
RNA molecules may also be used where they are in a form which is resistant to degradation 
such as by modification of the ends, by the formation of circular molecules, or by the use of 
alternate bonds including phosphothionate and thiophosphoryl modified bonds. In addition, 
the delivery of nucleic acid may be by facilitated transport where the nucleic acid molecules 
are conjugated to poly-lysine or transferrin. Nucleic acid may also be transported into cells by 
any of the various viral carriers, including but not limited to, retrovirus, vaccinia, AAV, and 
adenovirus. 



Alternatively, a recombinant nucleic acid molecule which encodes, or is, such antisense, 
ribozyme, triple helix, or PDEIOA molecule can be constructed. This nucleic acid molecule 
may be either RNA or DNA. If the nucleic acid encodes an RNA, it is preferred that the 
sequence be operatively attached to a regulatory element so that sufficient copies of the 
desired RNA product are produced. The regulatory element may permit either constitutive or 
regulated transcription of the sequence. In vivo, that is, within the cells or cells of an 
organism, a transfer vector such as a bacterial plasmid or viral RNA or DNA, encoding one or 
more of the RNAs, may be transfected into cells e.g. (Llewellyn et al., 1987, J. Mol. Biol, 
195:115-123; Hanahan et al. 1983, J. Mol. BioL, 166:557-580). Once inside the cell, the 
transfer vector may replicate, and be transcribed by cellular polymerases to produce the RNA 
or it may be integrated into the genome of the host cell. Alternatively, a transfer vector 
containing sequences encoding one or more of the RNAs may be transfected into cells or 
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introduced into cells by way of micromanipulation techniques such as microinjection, such 
that the transfer vector or a part thereof becomes integrated into the genome of the host cell. 



Composition, Formulation, and Administration of Pharmaceutical Compositions 

The pharmaceutical compositions of the present invention may be manufactured in a manner 
that is itself known, e.g., by means of conventional mixing, dissolving, granulating, dragee- 
making, levigating, emulsifying, enc^sulating, entrapping or lyophilizing processes. 

Pharmaceutical compositions for use in accordance with the present invention thus may be 
formulated in conventional manner using one or more physiologically acceptable earners 
comprising excipients and auxiliaries which faciUtate processing of the active compounds 
into preparations which can be used phamiaceutically. Proper formulation is dependent upon 
the route of administration chosen. 

For injection, the agents of the invention may be formulated in aqueous solutions, preferably 
in physiologically compatible buffers such as Hanks's solution. Ringer's solution, or 
physiological saline buffer. For transmucosal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. Such penetrants are generally known in 
the art. 

For oral administration, the compounds can be formulated readily by combining the active 
compounds with pharmaceutically acceptable carriers well known in the art. Such carriers 
enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules. 
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liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
treated. Pharmaceutical preparations for oral use can be obtained solid excipient, optionally 
grinding a resulting mixture, and processing the mixture of granules, after adding suitable 
auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in particular, 
fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations 
such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum 
tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 
may be added, such as the cross-linked polyvinyl pynolidone, agar, or alginic acid or a salt 
thereof such as sodium alginate. 

Dragee cores are provided with suitable coatings. For this piupose, concentrated sugar 
solutions may be used, which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, 
carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable 
organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or 
dragee coatings for identification or to characterize different combinations of active 
compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 
sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such 
as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended 
in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In 
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addition, stabilizers may be added. All formulations for oral administration should be in 
dosages suitable for such administration. 



For buccal administration, the compositions may take the form of tablets or lozenges 
formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present invention 
are conveniently delivered in the form of an aerosol spray presentation from pressurized 
packs or a nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 
trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the 
case of a pressurized aerosol the dosage unit may be determined by providing a valve to 
deliver a metered amount. Capsules and cartridges of e.g. gelatin for use in an inhaler or 
insufflator may be formulated containing a powder mix of the compound and a suitable 
powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.g., by bolus 
injection or continuous mfiision. Formulations for injection may be presented in unit dosage 
forai, e.g., in ampoules or in multidose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous 
vehicles, and may contain formulatory agents such as suspendmg, stabilizmg and/or 
dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of the 
active compounds in water-soluble form. Additionally, suspensions of the active compounds 
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may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or 
vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl 
oleate or triglycerides, or liposomes. Aqueous injection suspensions may contain substances 
which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, 
sorbitol, or dextran. Optionally, the suspension may also contain suitable stabilizers or agents 
which increase the solubility of the compounds to allow for the preparation of highly 
concentrated solutions. 

Altematively, the active ingredient may be in powder form for constitution with a suitable 
vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories or 
retention enemas, e.g., containing conventional suppository bases such as cocoa butter or 
other glycerides. 

In addition to the formulations described previously, the compoimds may also be formulated 
as a depot preparation. Such long acting formulations may be administered by implantation 
(for example subcutaneously or intramuscularly) or by intramuscular injection. Thus, for 
example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a cosolvent 
system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer. 
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and an aqueous phase. Naturally, the proportions of a co-solvent system may be varied 
considerably without destroying its solubility and toxicity characteristics. Furthermore, the 
identity of the co-solvent components may be varied. 

Alternatively, other delivery systems for hydrophobic pharmaceutical compounds may be 
employed. Liposomes and emulsions are well known examples of delivery vehicles or 
carriers for hydrophobic drugs. Certain organic solvents such as dimethylsulfoxide also may 
be employed, although usually at the cost of greater toxicity. Additionally, the compounds 
may be delivered using a sustained-release system, such as semipermeable matrices of solid 
hydrophobic polymers containing the therapeutic agent. Various of sustained-release 
materials have been established and are well known by those skilled in the art. Sustained- 
release capsules may, depending on their chemical nature, release the compounds for a few 
weeks up to over 100 days. Dependmg on the chemical nature and the biological stability of 
the therapeutic reagent, additional strategies for protein stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase carriers or 
excipients. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. 

Many of the compounds of the invention may be provided as salts with pharmaceutically 
compatible counterions. Pharmaceutically compatible salts may be formed with many acids, 
including but not limited to hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, etc. 
Salts tend to be more soluble in aqueous or other protonic solvents that are the corresponding 
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free base forms. 
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Suitable routes of administration may, for example, include oral, rectal, transmucosal, 
transdermal, or intestinal administration; parenteral delivery, including intramuscular, 
subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, 
intravenous, intraperitoneal, intranasal, or intraocular injections. 

Alternately, one may administer the compound in a local rather than systemic manner, for 
example, via injection of the compound directly into an affected area, often in a depot or 
sustained release formulation. 

Furthermore, one may administer the drug in a targeted drug delivery system, for example, in 
a liposome coated with an antibody specific for affected cells. The liposomes will be targeted 
to and taken up selectively by the cells. 

The pharmaceutical compositions generally are administered in an amount effective for 
treatment or prophylaxis of a specific indication or indications. It is appreciated that 
optimum dosage will be determined by standard methods for each treatment modality and 
indication, taking into account the indication, its severity, route of administration, . 
complicating conditions and the like. In ther^y or as a prophylactic, the active agent may be 
administered to an individual as an injectable composition, for example as a sterile aqueous 
dispersion, preferably isotonic. A therapeutically effective dose further refers to that amount 
of the compound sufficient to result in amelioration of symptoms associated with such 
disorders. Techniques for formulation and administration of the compounds of the instant 
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application may be found in ''Remington's Pharmaceutical Sciences," Mack Publishing Co., 
Easton, Pa., latest edition. For administration to mammals, and particularly humans, it is 
expected that the daily dosage level of the active agent will be from 0,001 mg/kg to 10 
mg/kg, typically around 0.01 mg/kg. The physician in any event will determine the actual 
dosage which will be most suitable for an individual and will vary with the age, weight and 
response of the particular individual. The above dosages are exemplary of the average case. 
There can, of course, be individual instances where higher or lower dosage ranges are 
merited, and such are within the scope of this invention. 

The invention further provides diagnostic and pharmaceutical packs and kits comprising one 
or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. Associated with such container(s) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, reflecting approval by the agency of the manufacture, 
use or sale of the product for human administration. 

EXAMPLES 

The present invention is further described by the following examples. These examples, 
while illustrating certain specific aspects of the invention, do not portray the limitations or 
circumscribe the scope of the disclosed invention. 
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Wild-type (B6CBAF1) and HD transgenic [B6CBA-TgN(Hdexonl)62Gpb] mice (Jackson 
Laboratories) and adult Sprague-Dawley rats (250-300 g; Charles River Laboratories) and 
were used in this study. The genotype of the mice was determined by PGR amplification of a 
100 bp region of the integrated human HD exon 1 transgene using primers corresponding to 
nts 3340-3459 (5*-AGG GCT GTC AAT CAT GCT GG-3') and nts 3836-3855 (5'-AAA 
CTC ACG GTC GGT GCA GC-3') of clone E4.1 of the human HD gene (Accession number 
L34020). PGR conditions used are described in Mangiarinietal.(1996). DNA was extracted 
&om a tail clip and an ear punch fix>m each mouse used in this study. Both samples were 
subjected to PCR genotype analysis. For in situ hybridization analysis, the animals were 
anesthetized with >100 mg/kg sodium pentobarbital, decapitated, the brains removed and 
stored at -70'C prior to sectioning. For RNA isolation, animals were anesthetized, 
decapitated and the striatum and cortex were excised and stored in liquid nitrogen prior to 
RNA extraction. Animal care was given according to protocols {^proved by Dalhousie 
University and the Canadian Council of Animal Care. 

Differential display was used to identify novel mDNA or previously described mDNA whose 
relative expression levels are altered as a result of the presence of the transgene. Using 
differential display, the mRNA populations derived fix)m the striatum of 10 week old wild 
type were compared with age-matched R6/2 transgenic mice. Differential display has been 
used extensively (> 750 references) since its development (Liang and Pardee, 1992) to 
identify changes in gene expression in cells and in tissues including brain (Douglass et al., 
1995; Babity et al., 1997a; Livesey et al., 1997; Berke et al., 1998). Perhaps the most 
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important finding was the demonstration by Qu et al., (1996) that differential display can be 
used to isolate genes differentially expressed in inbred strains of mice. The power of 
differential display is that the sequence information obtained can be directly related to the 
experimental paradigm. Moreover, such sequence information includes sufficient 
information to identify transcripts and can then lead to experiments that reveal function of the 
cognate protein in the experimental model. 

DNA sequence information of potentially differentially expressed cDNA can be used to 
generate oligonucleotide probes for in situ hybridization to define the anatomical and 
temporal pattems of expression of specific transcripts (see Babity et al., 1997a). This 
technique is especially useful to study changes in steady-state levels of mRNA in 
heterogeneous tissue such as brain. Brain tissue can be micro-dissected (Babity et al., 
1997b). This, enabled the present inventors to reduce the requirement for tissue, and hence 
compare the mRNA populations derived fi-om individual animals for each experimental 
group. 

Thus RT-PCR (Denovan-Wright et al., 1999) was used to identify differences in the patterns 
of gene expression between the striatum of wild-type and transgenic mice that were 
hemizygous for the 5' UTR, exon 1 and part of intron 1 of the human Huntingon's Disease 
gene. Total cellular RNA was isolated from the striatum and cortex of three 10 week-old 
wild-type and three 10 week-old R6/2 HD mice (Mangiarini et al., 1996) and used as the 
template to generate single-stranded cDNA. Total cellular RNA from each animal and tissue 
was purified using TrizoF*^ reagent (Gibco BRL) and the manufacture's protocol. 10 fig 
aliquots of total RNA were treated with RQl DNAse-free DNAse (Promega) in the presence 
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of DNAsin™ (Promega) DNAse inhibitor to remove trace genomic DNA and then converted 
to single-stranded cDNA. The primers and conditions for PGR amplification follow those of 
the Delta™ RNA fmgeiprinting manual (Clontech). 

The cDNA was then used as the substrate for PGR reactions using 57 differential display 
primer combinations. The radio-labelled PGR products were Jfractionated on a denaturing 
acrylamide sequencing gels using a Genomyx LR™ sequencing !q)paratu$, transferred to 
3MM filter ps^jer and dried. The dried aciylamide gels were exposed to autoradiogr^hy fihn 
(BioMax MR™) overnight. After fractionating the radio-labelled PGR products on denaturing 
acrylamide gels, it was found that the ovenvhehning majority of the approximately 18,000 
PGR products screened were common to both the wild-type and HD mice (data not shown). 
One PGR product, amplified using the primers P7 (5'-ATT AAG GGT GAG TAA ATG CTG 
TAT G- 3') and T6 (5'- GAT TAT GGT GAG TGA TAT GTT TTT TTT TGG- 3') of 
approximately 500 bp, was observed in each of three samples derived firam the striatum of 
wild-type mice (FIG. 1). This 500 bp band was absent fi-om the samples derived from tiie 
striatum of the HD mice (FIG. 1) and was absent from each of the samples derived from the 
cortical tissue (data not shown). 

FIG. 1 shows the Down-regulated in Huntington's Disease (PDEIOA) transcript, identified 
by differential display RT PGR. A band of approximately 500 bp (anew) was amplified from 
cDNA made form 10 week-old wild-type but not 10 week-old HD striatal tissue. Total RNA 
fiiom individual animals (numbered 1-6) was used as the substrate for the generation of 
single-stranded cDNA. Animals 1, 2 and 3 were transgenic HD mice. Animals 4, 5 and 6 
were wild-type mice. 
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EXAMPLE 2 - Cloning of PDEIOA 

The 500 bp band, designate PDElOApcr, was excised from the dried gel and rehydrated in 40 
\il of HjO for 10 min at room temperature. The eluted DNA was subjected to PGR re- 
amplification using the P7 and T6 primers, rTaq polymerase (Pharmacia) and the following 
conditions: 60" @ 94»C, 19 x (30" @ 94**C, 30" @ 58'C, 120" @ 68°C + 4" per cycle), T @ 
eS'C. The PGR reaction was subjected to agarose gel electrophoresis and the 500 bp band 
was removed from the gel, extracted from the agarose using the Qiagen gel extraction 
protocol and cloned into the vector, pGem-T using standard methods. Plasmid DNA was 
isolated from selected transfonnants using Qiagen spin columns. The resultant clone was 
named pPDEl OA. 

EXAMPLE 3 - Identification of PDEIOA 

The cloned insert of pPDElOA was radio-labelled and used as a hybridization probe in 
northem blot analysis (FIG. 2). Northern blots of total RNA were prepared using the method 
described in Denovan-Wright et al. (1998). The 500 bp cloned insert of PDEIOA was radio- 
labelled with [a-32P]dGTP (3000 Gi/mmol) using the Ready-to-Go dCTP beads (Phannacia). 
Northem blot hybridization, brain tissue preparation and in situ hybridization are described in 
Denovan-Wright et al. (1998). The 500 bp cloned insert of pPDElOA annealed to a transcript 
of approximately 9.5 kb in total RNA isolated &om the striatum often week-old wild-type 
mice. 
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FIG. 2 demonstrates that PDEIOA is expressed in the striatum but not the cortex of wild-type 
mice and the steady-state levels of PDEIOA are reduced in 10 week old transgenic HD mice. 
The differential expression of PDEIOA in HD mice was confirmed by northern blot analysis. 
The cloned insert of pPDElOA was radio-labelled and used as a hybridization probe in 
northern blot analysis. The northern blot was prepared by size-fiactionating total RNA bom 
the striatum and cortex of three individual 10 week-old HD (1, 2 and 3) and wild-type (4, 5 
and 6) mice. Following the hybridization of pPDElOA, the radio-label was removed and the 
blot was subsequently allowed to hybridize with a probe that detects constituitively expressed 
cyclophilin. The hybridization pattern of the cyclophilin probe is aligned below the northern 
blot demonstrating that equivalent amount of RNA were present in each lane. The relative 
mobility of RNA molecular weight standards (RNA ladder, Gibco BRL) are shown on the 
left of thei northern blot. 

The hybridization signal of pPDElOA was significantly lower in the RNA samples derived 
fi-om the striatum of 10 week-old HD mice. No expression of the PDEIOA mRNA was 
detected in the cortical RNA samples derived firom either the wild-type or HD mice, 

EXAMPLE 4 . Sequencing PDEIOA 

The sequence of the cloned differential display band, pPDElOA, was determined using M13 
universal forward and reverse sequencing primers and the T7 sequencing kit (Pharmacia). 
The 484 bp cDNA fi:agment did not have sequence similarity to any Genbank entries. 

FIG. 3 shows the nucleotide sequence of the cloned PDEIOA differential display product, 
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pPDElOA. The position of the primers used to amplify the fiagment are underlined and 
labelled. The nucleotide sequence and position of oligonucleotide probes 1 and 2 within the 
pPDElOA sequence are shown. 

EXAMPLE 5 - Isolation and Characterization of cDNA PDEIOA 

In order to isolate PDEIOA cDNA clones, oligonucleotide probes 1 and 2 were used in 5' and 
3' Rapid Amplification of cDNA Ends (RACE) reactions using commercially prepared 
RACE-ready mouse striatal cDNA (Clontech). Several independent clones were isolated and 
those that contained the sequence of pPDElOA were selected for further analysis. Each of the 
5' RACE clones was identical in sequence over the length that the clones could be aligned. 
The difference in length between these clones is a result of termination of the original 
reverse-transcriptase reaction at different positions along the nxRNA. No difference in size or 
sequence was detected between several 3' RACE clones. The longest 5* RACE clone and 
one 3' RACE clone were completely sequenced using internal primers. The present inventors 
were able to isolate a very short clone that extended the 5* RACE clone using an intemal 
primer (probe 3, 5'- CTA TTT CAC AAG AGA CTG ACC AGC CAA TAA ATC TC- 3'). 
The compiled sequence of the first PDEIOA cDNA clone, named cPDElOA-1 is presaited in 
FIG. 10. cPDElOA-1 is 3235 bp in length. The restriction map of cPDElOA-1 is shown in 
FIG. 11. 

The mRNA that hybridized with pPDElOA was approximately 9.5 kilobases in length. In 
order to obtain PDEIOA cDNA clone that was larger than cPDElO-1, the present inventors 
screened a mouse brain cDNA library. Several clones were identified that hybridized with 
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the pPDElO probe. The sequence of the largest of these cDNA clones, cPDElO-2, was 
determined. The sequence (FIG. 12) was 5753 base pairs in length. The restriction map of 
cPDElO-2 is shown in FIG. 13. 

cPDElO-1 and cPDElO-2 share sequence identity over 2095 bp. However, the 5' 1 142 bp of 
cPDElO-1 and the 5' 1689 bp of cPDElO-2 are unique to each clone. Clone cPDElO-2 
extends 1969 bp in the 3' direction compared to cPDElO-1. A schematic showing the regions 
of sequence identity and the imique sequences of cPDElO-1 and -2 are shown in FIG. 14, 

The compiled sequence of the mouse PDEIO cDNA clone, named cPDElOA, is presented in 
FIG. 15 with RACES. A further sequence, without RACEs, is shown in FIG. 19. The coding 
sequence and restriction map of cPDElOA is shown in FIG. 16, and updated at FIG. 17. FIG. 
18 is a restriction map of PDEIOA. The coding region has a met initiator commencing at 
nucleotide 257, with a stop codon ending at nucleotide 2596. 

PDEIOA was found to have extremely high homology with human PDElOs identified by 
Loughney et al., W099/42596, the contents of which are incorporated herein by reference. 

EXAMPLE 6 - Localization of PDEIOA in the Brain 

In order to identify the coding strand and to localize the transcript in the wild-type mouse 
brain, two oligonucleotide probes were designed (probe 1,5'- GAA CAT GTA GCA TAT 
ACT CCA GAC AAC AGA TCA TAT GG - 3'; probe 2, 5' - CAG CTT CTC CAC AGG 
AAC AC A GTA AC A AAG AG -3') that were complementary to different regions and 



50 



wo 01/24781 PCT/CAOO/01188 

Strands of the 484 bp pPDElOA clone. These oligonucleotides were used for in situ 
hybridization analysis. Using high stringency post in situ hybridization washes (2 x 30' in IX 
SSC @ 58°C, 4 X 15' in IX SSC @ 58^C, 4 x 15' in 0.5X SSC @ 58"C, 4 x 15' in 0.25X SSC 
@ 58°C), it was found that oligonucleotide probe 1 annealed with mRNA in the striatum, 
nucleus accumbens and olfactory tiibercule often week-old wild-type mice (FIG. 4). The 
hybridization signal was significantly reduced in the striatum, nucleus accumbens and 
olfactory tubercle of the 10 week-old HD mice (FIG. 4). 

FIG. 5 shows in situ hybridization of probe 1 to coronal (top three sections) and saggital 
(bottom section) 10 week-old wild-type (WT) and HD mouse brain sections. Specific 
hybridization of the probe was observed in the striatum, nucleus accumbens and olfactory 
tubercle of wild-type mice. The top three sections represent the distribution of PDEIOA 
throughout the rostral-caudal axis of the striatum. 

The in situ hybridization results confirmed the northern blot analysis demonstrating, 1) that 
the expression of PDEIOA mRNA was restricted to the striatum, nucleus accumbens and 
olfactory tubercle and 2) that the levels of PDEIOA mRNA were decreased in HD mice 
compared to the wild-type. The probe did not anneal with mRNA in any other brain nuclei. 
No hybridization of oligonucleotide probe 2 was observed in any region of the brain in wild- 
type or HD mice (Fig. 3). Based on this hybridization, the coding strand, complementary to 
probe 1 , of pPDEl OA was defined. 
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The in situ hybridization using oligonucleotide probe 1 demonstrated that PDEIOA mRNA 
levels in the striatum , nucleus accumbens and olfactory tubercule were decreased in ten 
week- old HD mice. By ten weeks of age, the HD mice all showed motor symptoms 
including resting tremor and stereotypic involuntary movements. Moreover, these mice 
immediately clasped their feet together and curled into a tight ball when picked up by their 
tails. 

As the phenotypic signs are progressive over a number of weeks, the present inventors 
examined whether the PDEIOA transcript was ever expressed in the striatum of the HD mice 
or Avhether the steady-state levels of the transcript diminished in the striatum in a course that 
parallelled the development of the motor disorders. Wild-type and HD mice were sacrificed 
at S, 7 and 8 weeks of age and their brains were prepared for in situ hybridization analysis 
using probe 1 (FIG. 5). 

FIG, 5 shows the levels of PDEIOA mRNA decrease in HD mice over the period of time that 
the HD mice develop abnormal movements and postures. In situ hybridization analysis of 
coronal and saggital sections of wild-type and HD mouse brain using oligonucleotide probe 1 
which is complementary to the coding strand of PDEIOA. At 5 weeks of age, before the 
development of motor symptoms, the HD mice express the PDEIOA transcript in the same 
brain nuclei and at the same relative levels as wild-type mice. The steady-state level 
PDEIOA decreases in the striatum, nucleus accumbens and olfactory tubercle from 5 to 10 
weeks in the HD but not wild-type mice. By 9 weeks of age, the HD mice have abnormal 
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movement and posture. The numbers refer to tbe age in weeks of the wild-type (WT) and 
Huntington's (HD) transgenic mice. 



None of the mice at these ages had overt motor symptoms. Sections taken throughout the 
rostral-caudal axis of the striatum showed that PDEIOA was expressed in the 5 week-old 
wild-type and HD mice. The relative hybridization of probe 1 did not change in 5, 7, 8 and 
10 week-old wild-type mice. The intensity of the hybridization signal appeared to decrease in 
the striatum, nucleus accumbens and olfactory tubercle of HD mice from 5 to 10 weeks 
compared to their wild-type litter mates (FIG. 5). 

The levels of PDEIOA were significantly reduced by 8 weeks of age in the HD mice, using 
two in situ oligonucleotide probes, one complementary to the 3' UTR, the second 
complementary to an internal portion of tfie coding region. The hybridization pattern 
observed in the wild-type and HD mice was the same for both the probes en^loyed. This 
analysis demonstrated that there is a reduction in the complete PDEIOA mRNA levels during 
the development of the HD phenotype and not that there was a differential reduction in the 
PDEIOA coding region as compared to the extensive 3' UTR. Moreover, in situ hybridization 
using the PDEl OA-specific probe against neurologically normal and HD human brain tissue 
demonstrated that there was a decrease in PDEIOA levels in human HD patients. 

One day old wild-type and HD mice were firozen, sectioned on a cryostat and whole mouse 
sections were prepared for in situ hybridization using probe 1. The same high stringency 
post-hybridization washing conditions were employed for the one day-old mouse body 
sections as were used for the adult mouse brain sections. Parallel in situ hyridization 
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experiments using the probe 2 were performed in order to determine the level of non-specific 
signal in the mouse sections. Probe 1 specifically annealed to the developing striatum (FIG. 

FIG. 6 demonstrates that PDEIOA is expressed in the developing striatum of one day-old 
wild-type and HD mice. The sections on the left were subjected to in situ hybridization using 
probe 1. Following hybridization, the sections were counter-stained with cresyl violet to 
visualize the mouse organs. The signal outside the brain was non-specific as probe 2 and 
other unrelated control oligonucleotide probes all labelled these tissues. 

There was no difference in the pattern of hybridization between the one day-old wild-type 
and HD mice demonstrating that PDEIOA was expressed in the developing brain of both 
wild-type and HD mice. 

Following in situ hybridization, the sections were covered in autoradiographic emulsion, left 
in the dark to expose for 4 weeks and then developed and viewed under dark-field 
microscopy or, after counter-staining the sections with cresyl violet to visualize neuronal cell 
bodies, under bright-field microscopy. Silver grains were observed to be concentrated in the 
striatum of the wild-type mice. FIG. 7 shows emulsion autoradiography of mouse brain 
sections following in situ hybridization of probe 1 demonstrated that the PDEIOA transcript 
is expressed in neurons. PDEIOA is not homogeneously distributed throughout the mouse 
striatum. Dark field illumination of the sections after emulsion autoradiography showed that 
the silver grains were clustered in specific regions of the 10 week old wild-type mouse 
striatum (A and C). Sections firom 10 week old HD mice subjected to identical in situ and 
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emulsion autoradiographic conditions are shown in B and D. The photomicrographs shown 
in A and B were viewed using the lOX objective (bar represents 100 pcm). The microgri^hs 
shown in C and D, were viewed under the 20X objective (bar represents 25 ixm). The insert 
in panel C is a portion of the section in A and C counter-stained with cresyl violet to visualize 
the neurons, viewed using the 40X objective under bright filed illumination. Note the 
distribution of the silver grains over some, but not all, of the striatal neurons as well as being 
concentrated aroimd clusters of neurons. It appeared that the silver grains were absent fiom 
fibre tracks within the striatum. It appeared that PDEIOA mRNA was not confined to regions 
close to the nucleus but was dispersed in cellular processes. 

Huntingtin with an expanded polyglutamine triict (htt-HD) is expressed in neurons of the 
brain and body throughout development and during the lifetime of HD patients (The 
Huntington's Disease Research Collaborative, 1993; Ross, 1995). Transgenic HD mice 
express a portion of htt-HD and develop a phenotype with many of the symptoms of HD after 
a period of normal development and growth (Carter et al., 1999; Cha et al., 1998; Mangiarini 
et al., 1996). Using differential display RT PCR, northern blot and in situ hybridization, we 
have demonstrated that PDEIOA mRNA levels decline in the striatum of HD mice. This 
specific member of the PDE multigoie fiunily is highly expressed in the striatum and 
olfactory tubercle of mice (Soderiing et al., 1999) and rats (Fujishige et al., 1999) and in the 
caudate and putamoi of humans (Fujishige et al., 1999; Loughney et al., 1999). The levels of 
PDEIOA were the same in 5 week old wild-type and HD mice. PDEIOA mRNA levels then 
began to decline and were ahnost undetectable in the striatum and olfactory tubercle by the 
time the mice reached 8 weeks of age. This time coincides with the onset of overt motor 
symptoms in the HD mice indicating that the loss of PDEIOA in striatal neurons leads to 
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dysfunction of the nuclei that control movement. The R6/2 mice develop the HD phenotype 
in the absence of cell death. The decrease in PDEIOA mRNA, therefore, is not due to the 
loss of PDElOA-expressing cells but rather a change in steady-state RNA levels that occurs 
due to the expression of mutant huntingtin. 

The particular isoform that decreases in HD is PDEIOA. PDEIOA has been cloned from 
human lung and fetal brain cDNA libraries (Fujishige et al., 1999; Loughney et al., 1999). It 
appears that the presence of the expanded polyglutamine tract in huntingtin alters gene 
expression in the striatum, and that this is the mechanism by which only a small group of 
neurons in the striatum and cortex are rendered vulnerable to this ubiquitously expressed 
mutant protein. 

EXAMPLE 8 - PDEIOA is Highly Conserved Among Mammalian Species 

The oligonucleotide (probe 1) complementary to the coding strand of the PDEIOA transcript, 
was also used as an in situ hybridization probe against coronal brain sections derived from 
adult rats. FIG. 8 shows in situ hybridization analysis of adult rat brain sections using 
oligonucleotide probe 1 complementary to the coding-strand of PDEIOA revealed that the 
pattem of expression of PDEIOA is the same in rats and mice. The hybridization conditions 
used to det^t the rat homologue of PDEIOA in rat brain tissue differed from those.used to 
detect the transcript in mice only in that the stringency of the post-hybridization washes were 
reduced. 

No hybridization was observed in the rat striatum using the post-hybridization washes 



56 



wo 01/24781 PCT/CAOO/01188 

employed following the in situ hybridization of mouse brain sections. However, when the 
stringency of the post-hybridization washes was lowered (2 x 60' in IX SSC @ 42^C, 2 x 60' 
in 0.5X SSC @ 42*^0, 2 x 60' in 0.25X SSC @ room temperature), the PDEIOA 
oligonucleotide probe specifically labelled the aduh rat striatum, nucleus accumbens and 
olfactory tubercule in a pattern indistinguishable from that observed in mouse brain sections. 
It appears, therefore, that a transcript which shares nucleotide sequence and expression 
pattern is present in both mice and rats. The evolutionary conservation of PDEIOA suggests 
that it is important for normal function of the basal ganglia. 

By northem blot, Fujishige et al. (1999) demonstrated that PDEIOA is expressed in human 
fetal brain. The homology between mouse and human PDEIOA is extremely high (data not 
shown). 

EXAMPLE 9 - Analysis of PDEIOA in Genomic DNA 

Because the transgenic mice employed in this study have a copy of the human HD 5' UTR, 
exon 1 with expanded CAG repeat and 262 bp of the intron 1 that has been integrated into an 
undefined locus of the mouse genome, it was possible that the integration event disrupted the 
PDEIOA gene preventing its expression in the HD mouse striatum. Genomic DNA was 
isolated fix>m wild-type and HD mice and subjected to Southern blot analysis. 

Genomic DNA was isolated from wild-type and HD mice and subjected to Southern blot 
analysis using pPDElOA as a hybridization probe. The size of the BaniHl and £coRI 
Augments that are present in the transgenic R6/2 line that correspond to the insertion of the 
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human exon 1 gene fragment are 1.9 and 0.8 (BamHT) and 1.9 (EcoBI) kb. Analysis of the 
size of the fragments that hybridized with pPDElOA demonstrated that there was no 
difference in the size of the hybridizing fragments between the wild-type and HD mice. FIG. 
9 shows the genomic DNA restriction fragments that hybridized with pPDElOA were the 
same in wild-type and HD mice. The size of the hybridizing BamlU and EcoRI fragments in 
each genomic DNA sample is approximately 8 kb and 3 kb, respectively. If the 1.9 kb Sad- 
EcoRl HD gene fragment integrated into the genome within the BamlU and EcoRI fragments 
that hybridized with the DHDM cDNA cloned insert, the sizes of the HD hybridizing bands 
would have been distinct from those of the wild-type. This Southern blot analysis indicates 
that the gene encoding PDEIOA is present as a single-copy in the mouse genome. The 
numbers at the left of the blot are the relative mobility of molecular weight markas (1 kb 
ladder, BioRad). 

The PDEIOA cDNA has since been cloned using a bioinformatics search strategy involving 
screening of the expressed sequence tag (EST) database for novel PDE cDNA clones. 
Independently, the mouse PDEIOA cDNA was identified after an EST search for novel PDEs 
with conserved cGMP binding domains (Soderling et al., 1999). The rat isoforms of 
PDEIOA and splice variants have also been described (Fujishige et al., 1999). Human, 
mouse and rat PDEIOA splice variants differ in their 5' untranslated and part of the 5' coding 
region but are identical in the coding region when the various splice variants are compared 
within each species. The human, mouse and rat PDEIOA coding regions contain 779, 779 
and 794 amino acids, respectively, encoding a protein of iq>proximately 88.5 KDa. 
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EXAMPLE 10 . Distribution of PDEIOA 

In mouse, PDEIOA mRNA was detected in testis and to a much lesser extent in brain but not 
in heart, spleen, lung, liver, skeletal muscle, kidney, ovary, pancreas, smooth muscle, eye or 
in total RNA isolated from 7, 1 1, 15 or 17 day old embryo (Soderling et al., 1999). This data 
agrees with the PDEIOA mRNA pattern of distribution that we observed in wild-type and 
pre-symptomatic HD mice. In mice, two different size transcripts are detected in northern 
blots using the coding region as a probe. In mouse testis, the most abimdant transcript is 
approximately 4 kb. A 9.5 kb transcript was also detected in mouse testis. It appears that the 
most abundant transcript in mouse brain is 9,5 k. Similarly, two sized PDEIOA transcripts 
were observed in rats, however, it appears that, in rat, the 4 kb mRNA is expressed 
exclusively in testis while the 9.5 kb mRNA is expressed exclusively in brain (Fujishige et 
al., 1999). Within the brain, the rat PDEIOA mRNA was expressed in striatum and olfactory 
tubercle and not cortex, cerebellum, hippocampus, midbrain or brainstem. In humans, 
PDEIOA is expressed in the caudate, putamen and testis. As was observed in rodents, 
mRNAs of approximately 4 and 10 kb hybridized with the PDEIOA probe. Again, it appears 
that, although both sized transcripts are present in brain and testis, the larger mRNA is 
predominant in the caudate and putamen and the smaller sized transcript is present in the 
testis. Each of the mouse, rat and human PDEIOA sequences are not longer than 4 kb and 
span the codmg region and parts of the 3* UTR. The difference in abundance of the short and 
long transcript in the testis and brain, respectively, in all three species suggest that the 3' UTR 
functions to provide transcript stability in the brain. As such, we present the complete 
sequence of the brain-specific transcript of PDEIOA derived from mouse. 
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Modulating Activity of PDEIOA Using cGMP-PDE Activity 



Cyclic nucleotides are the predominant second messengers that activate cellular sigiuding 
pathways (Beavo, 1995; Conti and Jin, 1999). The concentration of intracellular cyclic 
nucleotides is dependent on their rate of synthesis by adenyl and guanyl synthase, the rate of 
. efflux from the cell, and the rate of degradation. PDEs hydrolyze cAMP and cGMP limiting 
both the duration and amplitude of the cyclic nucleotide signal (Beavo, 1995; Conti and Jin, 
1999). In mammals, PDEs are encoded by a large multigene family. The various PDE 
family members have tissue-specific pattems of expression (Conti and Jin, 1999). PDEs have 
also been described in Caenorhabditis, Drosophila, Dictyostelium, Saccharomyces. Candida 
and Vibrio species demonstrating that this enzyme has been conserved throughout evolution. 
In mammals, PDEs are encoded by at least 10 gene famiUes, each composed of one or more 
genes. In addition, numerous splice variants of individual gene family members have been 
described. These splice variants alter the 5' domain of the protein but share identical 
nucleotide binding and catalytic domains. The catalytic domain, found in the caiboxy- 
terminus of the enzyme, is ~ 275 amino acids and highly conserved in amino acid sequence in 
all PDEs. In total, it appears that there are -50 PDEs expressed within the mammalian body. 
Some PDEs are expressed in multiple tissues while others have a very limited tissue-specific 
distribution (Conti and Jin, 1999). 

PDE gene families differ with respect to their affinity for cAMP and cGMP and their 
dependence on calcium and cahnodulin (Beavo, 1995). Moreover, some PDEs are inhibited 
or activated by binding cyclic nucleotides to a non-hydrolytic site. For example, PDE2A has 
a lower K„ for cGMP than cAMP although it hydrolysed both nucleotides. The binding of 



60 



wo 01/24781 PCT/CAOO/01188 

cGMP to an allosteric activator site within PDE2 enhances the rate of catalysis of cAMP. 
PDE2 is, therefore, a cGMP-stimulated cGMP and cAMP phosphodiesterase (Beavo, 1995). 
Conversely, the affinity of PDE4 for cAMP is much greater than for cGMP and PDE4 
activity is.not affected by cGMP or calmodulin (Beavo, 1995). The differences in substrate 
preference, modulation of activity and tissue-specific patterns of expression suggest that 
subtle aherations in the relative levels of cAMP and cGMP mediated through the action of 
various PDBs lead to a wide range of responses to extracellular signals. 

cGMP-PDE activity of compounds is measured using a one-step assay adapted fi^om Wells at 
al. (Wells, J. N., Baird, C. B., Wu, Y. J. and Hardman, J. G., Biochim. Biophys, Acta 384:430 
(1975)) and adopted by Beavo et al, U.S. Patent No. 6,037,1 19. The reaction medium 
contains 50 mM Tris-HCl, pH 7.5, 5 mM Mg-acetate, 250 ug/ml 5 -Nucleotidase, 1 mM 
EGTA and 0. 1 5 uM 8-[tf ]-cGMP. The enzyme used is a human recombinant PDE V 
(ICOS, Seattle U.S.A.). 

Compounds of interest are dissolved in DMSO finally present at 2% in the assay. The 
incubation time was 30 minutes during which the total substrate conversion did not exceed 
30%. 

The IC 50 values for the compounds examined are determined from concentration-response 
curves using typically concentrations ranging from 10 nM to 10 uM. Tests against other 
PDE enzymes using standard methodology also show compounds highly selective for the 
cGMP specific PDE enzyme. 
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Rat aortic smooth muscle cells (RSMC) are prepared according to Chamley et al. in Cell 
Tissue Res. 177:503-522 (1977) and used between the 10th and 25th passage at confluence in 
24-well culture dishra. Culture media is aspirated and replaced with PBS (0.5 ml) containing 
the compound tested at the appropriate concentration. After 30 minutes at 37" C, particulates 
guanylate cyclase are stimulated by addition of ANF (100 nM) for 10 minutes. At the end of 
incubation, the medium is withdrawn and two extractions were performed by addition of 65% 
ethanol (0.25 ml). The two ethanolic extracts are pooled and evaporated until dryness, using 
a Speed-vat system. c-GMP was measured after acetylation by scintillation proximity 
immunoassay (AMERSHAM). The ECjq values are expressed as the dose giving half of the 
stimulation at saturating concentrations. 

EXAMPLE 12 - Selected Modulators of PDEIOA Activity 

The catalytic domain of PDEIOA is most similar in amino acid sequence to PDE5A, PDE2A, 
PDE6B and PDE6A. These members of the PDE family each contain a cGMP binding 
sequence that is not observed in other PDE family members. The non-catalytic cGMP 
binding sites (OAF) domains found in PDE2, 5 and 6 are also found in PDE 10. At least for 
PDE2, this site acts as an allosteric activator for cAMP hydrolytic activity. The OAF domain 
of PDEIOA binds othCT small molecules that act as allosteric activators. PDEIOA is a cAMP 
and cAMP-inhibited cGMP PDE (Fujishige et al., 1999; Fujishige et al., 1999; Loughney et 
al., 1999; Soderling et al., 1999). 

Attenuation of the production of cAMP, may ameliorate the symptoms of HD and positively 
affect gene expression. Phaimaceutically acceptable modulators of cAMP include quinpirole. 
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alloxan, miconazole nitrate, MDL-12330A, and tetracyline derivatives such as 
demeclocycline and minocycline. 



Compounds which are potent and selective modulators of cGMP-specific PDE, and are useful 
in a variety of therapeutic areas are taught by Daugan et al, U.S. patent No. 5,981,527, PCT 
publication No. WO 00/15639 to Icos Corporation and PCT publication No. WO 00/15228 to 
Icos Corporation, which are incorporated herein by reference. Such compounds include, for 
example: 

(6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-2-methyl-pyrazino(2', 
r:6, 1 ]pyrido[3,4-b]indole- 1 ,4-dione, 

(6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-pyrazino[2',l':6,l]pyrido[3,4- 
]indole-l,4-dione, 

(6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-2-isopropyl-pyrazino[ 
2',r:6,l]pyrido[3,4-b]indole-l,4-dione, 

(3S,6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-3-methyl-pyia2ino[ 
2',r:6,l]pyrido[3,4-b]indole-l,4-dione, and 

(3S,6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-2,3-diraethyl-pyraz 
ino[2',r:6,l]pyrido[3,4-b]indole-l,4-dione. 

PDEIBI is expressed throughout the brain and is most abundant in the striatum, nucleus 
accumbens and olfactory tubercle (Polli and Kincaid, 1994; Yan et al., 1994). PDEIB is a 
cGMP, Ca/cahnodulin-dependent PDE. Therefore, PDEIB and IDA are both expressed in 
the majority, but not all, striatal neurons and, it is likely that both genes are co-expressed in a 
subset of striatal projection neurons. Selective inhibitors for PDEl include KS-505, IC224, 
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and SCH 51866. Of these inhibitors, it appears that SCH 51866 has a ten-fold higher Km for 
PDEl than PDEIO (Soderling et al, 1999). The non-specific PDE inhibitor. IBMX is a potent 
inhibitor of PDEIOA. Dipyridamole and SCH51866 had the highest potency of inhibitors 
tested on PDEIOA activity. Dipyridamole was considered to be a PDE5- and PDE6-specific 
inhibitor, however, the Km for dipyridamole is 10 times higher for PDEIOA than the other 
PDEs (Soderling et al., 1999). Selective inhibitors of PDE5, 2, 3 and 4 had much greater 
IC50 for PDEIO (Soderling et al., 1999). 

EXAMPLE 13 - Clinical use of PDEIOA Modulator 

A 38 year-old female was admitted to hospital from a long-term care facility due to 
progressive deterioration of her physical and mental symptoms caused by Himtington*s 
disease. The patient had been diagnosed with Huntington's disease at age 26. Prior to 
admission to the hospital, she had become increasingly aggressive and uncooperative. 
Moreover, there appeared to be an increase in the number of psychotic episodes. SPECT 
showed no abnormality of brain blood flow but MRI showed bilateral caudate atrophy as well 
as global atrophy of the cerebrum and corpus callosiun. 

The patient had been stable for a number of years on the antipsycotic haloperidol (3 mg/day). 
For the last two years, the haloperidol had been replaced by olanzapine (2.5-7.5 mg/day). 

Minocycline, a tetracycline derivative, was administered at 50 mg twice daily for 7 days, 
followed by 100 mg twice daily for 7 days and finaliy 200 mg twice daily for 5 weeks. After 
5 weeks of 200 mg twice daily minocycline administration, there was a mild improvement 
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compared to the baseline clinical global assessment made at the time of admission. The 
minocycline treatment was suspended for 7 days. Due to a significant increase in the number 
of aggressive incidence and decrease in cooperativity, minocycline (200 mg twice daily) 
treatment was resumed. The patient responded within 3 days to the resumed minocycline- 
treatment with a return to mild-improvement compared to the baseline clinical global 
assessment made at the time of admission. Minocycline (200 mg twice daily) treatment will 
continue indefinitely. The improvement in behaviour and decrease in apparent psychosis has 
allowed for the transfer of the patient fi-om the acute care facility back to long-term care. 

While the present mvention has been described in terais of specific embodiments, it is 
understood that variations and modifications will occur to those skilled in the art. 
Accordingly, only such limitations as appear in the appended claims should be placed on the 
invention. 
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We claim: 

1 . A composition for treating a CAG repeat disorder comprising a compound which 
modulates PDEIOA expression and a pharmaceutically acceptable carrier. 

2. A composition as claimed in claim 1, wherein said compound is selected from the group 
consisting of: quinpirole, alloxan, miconazole nitrate MDL-12330A, and tetracyline 
derivatives such as demeclocycline. 

3. A composition as claimed in claim 1, wherein said compound is selected from the group 
consisting of: 

(6R,12aR)-2,3,6,7J2,12a-Hexahydro-6-(5-benzofuranyl)-2.methyl-pyra2ino[2\ 
r:6, 1 ]pyrido[3 ,4-b]indole- 1 ,4-dione, 

(6R,12aR)-2,3,6,7J2,12a-Hexahydro-6-(5-benzofuranyl)-pyrazino[2',r:6,l]py rido[3,4- 
]indole-l,4-dione, 

(6R, 1 2aR)-2,3 ,6,7, 12,1 2a-Hexahydro-6-(5-benzofurany l)-2-isopropy l-pyrazino[ 
2\ r :6, 1 ]pyrido[3 ,4-b]indole- 1 ,4-dione, 

(3S,6R, 1 2aR)-2,3,6,7, 12, 1 2a-Hexahydro-6-(5-benzofuranyl)-3-methyl-pyrazino[ 
1\ 1 ':6, 1 ]pyrido[3,4-b]indole- 1 ,4-dione, 

(3S,6R, 1 2aR)-2,3,6,7, 1 2, 1 2a-Hexahydro-6-(5-benzofuranyl)-2,3-dimethyl-pyraz 
ino[2', r:6, 1 ]pyrido[3,4-b]indole- 1 ,4-dione. 

4. A composition as claimed in claim 1, wherein said compound is selected from the group 
consisting of: KS-505, IC224.SCH 51866, IBMX and Dipyridamole. 
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5. A composition as claimed in any one of claims 1 to 4, wherein said disorder is 
Huntington's disease. 



6. The use of a composition as claimed in any one of claims 1 to 5 for treating a CAG repeat 
disorder comprising administering said composition to a subject in need of such treatment. 

7. The use of a composition of claim 6 for treating Huntington's disease comprising 
administering said composition to a subject in need of such treatment. 

8. A method for identifying a compound which inhibits or promotes a CAG repeat disorder, 
comprising the steps of: 

(a) selecting a control animal having PDEIOA and a test animal having PDEIOA; 

(b) treating said test animal using a compound; and, 

(c) determining the relative quantity of RNA corresponding to PDEIOA, as between said 
animals. 

9. A method of claim 8, wherein said animal is a mammal. 

10. A method of claim 9, wherein said mammal is a mouse. 

11. A method of claim 10, wherein said mouse is R6/2 transgenic mouse. 

12. A method of any one of claims 8 to 1 1, wherein said CAG repeat disorder is 
Huntington's disease. 
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13. A method for identifying a compound which inhibits or promotes a CAG repeat disorder, 
comprising the steps of: 

(a) selecting a host cell containing PDEIOA; 

(b) cloning said host cell and separating said clones into a test group and a control group; 

(c) treating said test group using a compound; and 

(c) determining the relative quantity of RNA corresponding to PDEIOA, as between said test 
group and said control group. 

14. A method of claim 13, wherein said CAG repeat disorder is Huntington's disease. 

15. A method for detecting the presence of or the predisposition for a CAG repeat disorder, 
said method comprising determining the level of expression of RNA corresponding to 
PDEIOA in an individual relative to a predetermined control level of expression, wherein a 
decreased expression of said RNA as compared to said control is indicative of a CAG repeat 
disorder. 

16. A method of claim 15, wherein said CAG repeat disorder is Huntington's disease. 

17. A method of claim 15 or 16, wherein said expression is measured by in situ 
hybridization. 

18. A method of claim 15 or 16, wherein said expression is measured using a polymerase 
chain reaction. 
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19. A method of claim 15 or 16, wherein said expression is measured using a DNA 
fingerprinting technique. 
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Figure 3 




5' 61 71 81 91 

c , ATGTTCA TTTACTGTACAAAAACCCAGTGCAGCTGATGATGCAAAGCAGT 
^ ^ TACAAGt TTyyiTGACATGTTTTTGGGTCACGTCGACTACTACGTTTCGTCA 

5' 11 21 31 41 

, CTCTCTCTGTGTACAGTGCCCCACCTATTTATUUUVTCACGTACTTGCCCA 
GAGAGAGAGACaLTGTCACGGGGTGGATAAATTTTTAGTGC ATGAAO^ 

5' 61 71 81 91 

GAACACTGTGAAAOVCTTAAOITAAGAAGW^CGCAGCXSTCTGGAT^ 
151 CTTGTGACAC TTTGTGAATT GTATTCTTGT TTGCGTCGCAGACCTAAGAA 

5 • 1 1 p ^Wl 21 31 41 

Tr PJX AGQAGAG pAQCTTTCT CCACAGGTACACAG TAACAA AAGAOhTPPfl 
" AGGTTCCTCT CGTCGAAAGAGGTGTCCTTGTGTCATTGTTTTCTCCAGGC 

5' 61 .71 81 91 

- CCGCCATCCACACCCAGCCAAGACACCTCAGAGGCCATAGGGACAACCTC 
GGCGGTAGGTGTGGGTCGGTTCTGTGGAGTCTCCGGTATCCCTGTTGGA6 

5' 11 21 31 41 

- - , CTTGCTGGCCAAaVCCTGCTGGAGCAGGGGCAa^GGTCCCAGCy^CTGAT 
" GAACGACCGGTTGTGGACGACCTCGTCCCCGTGTCOVGGGTCGTTGACTA 

5' 61 71 81 91 

_ CI CCTCAGTGGATGGGTCTGCAGCCAAAGCCTTAATGGGCTCTCTTTTGAAG 
GGAGTCACCTACCCAGACGT CGGTTTCGGAATTACCCGAG AGAAAACTTC 

5' 11 21 31 41 

-rtT GGGAAAGAAAGAATTTCAAGCTTATGATATCCAATATTATTATAGTTGAT 

* " CCCTTTCITTCiTAAAGTTCGAATACnATAGGTTATAATAATATCaACT^ 

5' 61 71 81 91 

GAGTTAGTAAATTCCAAAAAAAAAA 
* ^ CTCAATCATTTAAGGTTTTTTTTTT 
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Figure 8 



8/41 



wo 01/24781 



PCT/CAOO/01188 



BamHI EcoRI 



W1 W2 H1 H2 W1 W2 HI H2 




0.5_ 



Figure 9 



9/41 



wo 01/24781 



PCT/CAOO/01188 



Figure 10 



5* 11 21 31 41 

^ GTGACTTCGACCAGGTGOVGATATTTGTCCACreTGACaSACGTC^ 

S« 61 71 81 . 91 

AGCCATTCGATCCACy^CAAATTGATCTTCTATaVTCTTGGAATCTGAATT 
^ TCGGTAAGCTAGGTGTGTTT AACTAGAAGATAGTAGAACC TTAGACTTAA 

5< 11 21 31 41 

GCAGGGAGGAGCy^GTATGTAAGACGACCGTTTAATTCAGGCATTCCGAAG 
101 cGTCCCTCCT CGTCATACATTCTGCTGGCAAATTAAGTCCGTAAGGCTTC 

5« 61 71 81 91 

- GCATGAGCGCATGGATTCTG TCACCAAGCGTATAAAAGGACCCTGGCATT 
^ CGTACTCGCGTACCTAAGACAGTGGTTCGCATATTTTCCTGGGACCGTAA 

5* 11 21 31 . 41 

GGGAAACCTATGACX3GACrroTTTTTGCIX3TAGAAGTAGGGATT^ 
201 CCCTTTG^ATACTGCCTGACAAATACXSACATCTT^TCCCTAAAATGTCT 

5' 61 71 81 91 

o c 1 AGTCTCCTTGAATTTGCCCTGCCTGGGGCAGTTTTGCA6AGGAACCTGCC 
251 TCAGAGGAACTTAAACGGGACGGACCCCGTa^AAAOSTCrCCrrGGACGG 

5' 11 21 31 41 

AGAGATTTATTGGCTGGTCAGTCTCTTGTGAAATAGTATCATGTGAGAAA 
301 TCTCTAAATAACCGACCAGTaiGAGAACy^CTTTATa^TAGTACU^CTCI^ 

5« 61 71 81 91 

CAGTTTGTAGAAAAAAACTATACCTGGGAAGACCTTTGGAACATTGTTCC 
351 GTCAAACATCTTTTTTTGATATGGACCCTTCTGGAAAOSTTGTAACAAGG 

5' 11 21 31 41 

TTCCATGGGCCAAGACTCAGTTAGGAGGCATAAATCTGCCCGGAATAAAC 
■* ° AAGGTACCCGGTTCTGAGTCAATCCTCCGTATTTAGACGGGCCTTATTTG 

5' 61 71 81 91 

TAGGCCAGGATAa\GCCATGTTTAGTTAATAATTTGGTTTTAGAATT<^ 
* ^ ATCCGGTCCTATGTCGGTAC AAATCAATTATTAAACCAAAATCTTAAGTG 

5' 11 21 31 41 

ACAGGCAGGATTGGTTTTTTTGTGTCTTGG CAAGTGGAGC ATATTT7ACA 
501 TGTCCGTCCTAACO^AAAAAAO^CAGAACCGTTaiCCTCGTATAAATTGT 

5' 61 71 81 . 91 

- r TACAGGCATG GGAATCCTGC CTCTTAGCTT TTCCCACCCT CTTGTCTCAC 
551 ATGTCOSTACCCTTAGGACGGAGAATCGAAAAGGGTGGGAGAACAQAGTG 

5' 11 21 31 41 

a^GTTTTTTCTCTCCAAAGGTTTCCAGGTLATTTCTCATTAATGGC^^ 
601 GTTCAAAAAAGAGAGGTTTCCyU\AGGTCCrTAAAGAGTAATTACCGACTA 
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Figure 10 continued 



5« 61 71 81 91 

- _ , GCAAACTTAG TGAATAATAATGAATATAAACAATGCTCAC CTCACCAAAA 

^^^CGTTTGAATCACTTATTATTACTTATATTTGTTACGAGTGGAGTGGTTTT 

5« 11 21 31 41 

TTATATTATTTGOVGTCaiTTTGTGATAACyVCaU^TTTTATCGCAATGGTT 
701 ;^TATAATAAACX3TCAGTAAACACrATTGTGTTTA7^TAGCGTTACCAA 

5« 61 71 81 91 

ATTATTTAATTTGTGGCCACACa^CTGTGGTTATCTTTTGTTGTGGTTC 
"^^^ TAATAAATTAAACACOSGTGTGTGACy^CCy^TAGAT^CAACACa^C^ 

5' 11 21 31 41 

TCTGAGAAAATGTTGTTGGATATGTAAGTG CCAATACCAGTGTGAAGTAT 
^ ° ^ AGACTCriTTT ACUlAGAACCr ATACATTCACGGTTATGGTC ACACTT 

51 61 71 81 . 91 

TGATCCCGGG CAGCAAAATACAGCCTAAGGTTTGTAAACATCAATTCTAT 
® ^ ^ ACTAGGGCCCGTCGTTTTATGTCGGATTCCAAACATTTGT AGTTAAGATA 

5 1 11 21 31 41 

CTCAGTTCATCAGAGGGCCTGAGAAGCTGCGGGGCAGTGTAAAGTAAAGT 
^ ° ^ GAGTCAAGTAGTCTCCCGGACTCTTCGACGCCCCGTCACATTTCATTTCA 

5. 61 71 81 91 

ATGCTGGGCT GGTGGTGGTC AGCCTCCCGC CTGAAGAGTG ACCAGTGCTG 
^^•"-TACmCCCGACa^CCACCaiGTCGGAGGGCGGAClTXJTCACTGGTCAaaAC 

S.« 11 21 31 41 

GCCCGACGGATCGCTGAGATATTCTCCCATAATGGCAAAAAAATAGGCAG 
^ ° ° ^ CGGGCTGCCTAGCGACTCTATAAGAGGGTATTACCGTTTTTTTATCCGTC 

51 61 71 81 91 

TTTGATGTGACCTGTTTAGTGTGGCTCTCCTCTTTTGAGCATGTGTTAGC 
AAACTAO^CrGGACAAATCACACCXSAGAGGAGAAAACTCXSTAa^C^ 

5« 11 21 31 41 

ATTTTTATTTTATACTCATCCAGTGAACTCTGCTCTTCCAAGTGTGTTCA 
1101 TAAAAATAAAATATGAGTAGGTCACTTGAGACGAGAAGGTTCACACAAGT 

5. 61 71 81 91 

TGTATGTGCT AGATATATTAGCACAGCCTG CCTTCTGCTG CACAACGCCT 
1151 ;^c;^TACACGATCTATATAATCGTGTCGGACQGAAGACGACGTGTTGCGGA 

5> 11 21 31 41 

TAGAGACCCGGCCTTTCAATGAGCTTAGCTTGTGCTCTGTTTCTGCTCTC 
ATCTCTGGGCCGGAAAGTTACTCGAATCGAACACGAGACAAAGACGAGAG 

51 61 71 81 91 

TTAGGTCTAAACTATGGTGTCAGTTTTAATAGAACAAAAGTATGCATCTT 
•'•^^•'^ AATCCAGATTTGATACCACAGTCAAAATTATCTTGTTTTCATAC^ 
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Figure 10 continued 



5' 11 21 . 31 41 

1301 <3CCTTGGCTTGAGCCTTTTCGTTTTClAATGCPGACTTe^ 

CGGAACCGAACTCGGA/^GOU^GTTACGACTGAAGAGGGGAi'.AGAGA 

5' 61 71 81 91 

^35T CCTGTGCTCACCTTACCTTTCCAGAGTGTAAGGGACAACTTTTAAGQAGG 
GGACACGAGTGGAATGGA/UIGGTCTCACATTCCCTGTTGAAAATTCCTCC 

5' 11 21 31 41 

1401 CGTGTCCCTGGTAGGGGCATGGCTGTTCACCAGGTGCCTGTCATCACCCC 
GCACAGGGACCATCCCCGTAGGGACAAGTGGTCCACGGACAGTAGTGGGG 

5' 61 71 81 91 

1451 ACTTGACTGACATCTACCCTGGTGACTATG GGTTCCTCTTGTTTGTAGGG 
TGAACTGACTGTAGATGGGACCACTGATAC CCAAGGAGAACAAACATCCC 

5' 11 21 31 41 

1501 ^CXSGTGGCTCCS^GGTGGAGGCATaVATCTGTTGGGTTCTGGTTCCC^^ 
TTGCCACCGAGGTCCACCTCCGTAGTTAGACAACCCAAGACCAAGGGCCG 

S' . 61 71 81 91 

ISSl^^^^'^^^^'^'^^'^^^^'^QTCTCTTCTCTGTATATTCCT 

Aa»3AAACCaLAAACTTTCAGA6AAGAGACATATAAGQATGGGACGTAAAC 

5' 11 21-31 41 

, g Q T CTTTGTGTGG TGCTGATGCTGTGCGCAGTAGGATTCTTGaATQACTCTCC 
GAAACACACCACGACTACGACACGCGTCATCCTAAGAACCTACTGAGAGG 

5" 61 71 81 91 

1651 ATCAGTCACAGACTCCCCCTGTTGCAAAGTGTCAGGCTGACTCGACAGTC 
TAGTCAGTGTCTGAGGGGGACAACGTTTCACAGTCCGACTGAGCTGTCAG 

5' 11 21 31 41 

. „ Q T ACCGTAAAAT CTGAGTCAGT CACACACAGG CTGTCAGCCACGGCTTCCAC 
TGGCATTTTAGACTCAGTCAGTGTGTGTCCGACAGTCGGTGCCGAAGGTG 

5' 61 71 81 91 

, 7 5 T TTGCATGGCT ATTCTATTTT CACACGTGAGTTTCTGTTGC TGGCTGGCTG 
■^AACGTACCGATAAGATAAAAGTGTGCACTCAAAGACAACXSACCGACCGAC 

5' 11 21 31 41 

looi^CTGGCATTATCTATGCTAAGTTGAAATCAGGAGTGCCCAGCAGAGCCCA 
TGACCGTAATAGATACGATTCAACTTTAGTCCTCACGGGTCGTCTCGGGT 

5' 61 71 81 91 

1851 TCATTCTCAC TGTCTTTGAAACAAAGCTGT ACGGTTTGAT CGATGAACGT 
AGTAAGAGTGAOlGAAACTTTGTTTCGAakTGCa^AACTAGCTACTTGCa^ 

5' 11 21 31 41 

, - ATTTAAAGCATTTCATGCAATGACAAAGTG CTCAGTAGTGGAAGGCAGGC 
TAAATTTCGTAAAGTACGTTACTGTTTCACGAGTCATCACCTTCCGTCCG 
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Figure 10 continued 



5' 61 71 81 91 

o _ - TGTGACa\GTCTGCCTGCTCCTTACTATAATTGTGAGGATTTGTTACTGG 
■'^^^■'■ACACTGGTCAGACGGACGAGGJ^ATGATATTAACACTCCTAAACAATGACC 

5' 11 21 31 41 

AACAGTACATGGAGGCCTGACCTTGTGGGGGCACAGGGTGGAACCTTAGC 
2001 TTGTCATGTACCTCCX3GACTGGAACACCCCCGTGTCCCAC CTTGGAATCG 

.5' 61 71 81 91 . 

TGAATATAGTGTGTGTCTCAAGAGGAAGTCAGGGTACTAGCTCAGTGCTC 
20=lA(nTATATCACy^CACAGAGTTCTCCTTCAGTCCCATGATCGAGTCACGAG 

S' 11 21 31 41 

^, AATCTCCAGGTACTATATATACa^TTTGCCCGTTTTATCTCTAATGTGAAA 
2101 TTAGAGGTCC ATGATATATATGTAAACX3GG CAAAATAGAG ATTACACTTT 

5' 61 71 81 91 

TAAATCCCCAAACACTTGTTTATCGTGTAG CGTACCTAAAAGACTATTCT 
2151 ATTTAGGGGTTTGTGAACaULATAGCACATCGCATGaATTTT 

SI 11 21 31 41 

ATTATGGGTG TCCCCACTTT CTTGGTTTGG TCACCCCGAT CCCCCGGTCT 
2201 TAATACCCAC AGGGGTGAAAGAACCAAACC AGTGGGGCTAGGGGGCGAGA 

5< 61 71 81 91 

^ c TCTGCTGTATCTAGAACA6TGACTATAAATGATGTATGGGAATAGTGTTT 
^■^^^ AGACGACATAGATCTTGTCACTGATATTTACTACATACCCTTATCACA7A 

5« 11 21 31 41 

- - ^ . CCATATGATC TGTTGTCTGG AGTATATGCT ACATGTTCAATTACTGTACA 
^■^ ° GGTATACTAGACAACAGACCTCATATACGATGTACAAGTTAATGACATGT 

5' 61 71 81 91 

^ _ c , AAAACCCAGTGCAGCTGATGATGCAAAGCAGTCTCTCTCTGTGTACAGTG 
2351 TXTTGGGTCA CGTCGACTAC TACX3TTTCGT CAGAGAGAGACACATGTCAC 

51 11 21 31 41 

. - CCCCACCTATTTAAAAATCACGTACAASCCCAGAACACTGTGAAACACTT 
^^"•'"GGGGTGGATAAATTTTTAGTGCATGTTSGGGTCTTGTGACACTTTGTGAA 

5- 61 71 81 91 

AACATAAGAACAAACGCy^GC GTCTGGATTC TTTCCAAGGAGAGCAGCTTT 
2451 rj.TGTATTCTTGTTTGCX3TCG CAGACCTAAG AAAGGTTCCTCTCGTCGA7UV 

5" 11 21 31 41 

- _ - , CTCCACAGGAACACAGTAAC AAAAGAGGTC CGCCGCCATC CACACCCAGC 
GAGGTGTCCTTGTGTCATTG TTTTCTCCAGGCGGCGGTAGGTGTGGGTCG 

5" 61 71 81 91 

_ c c T CAAGACACCTCAGAGGCCATAGGGACAACCTCCrrGCIXSGCaU^ 
2551 GXTCTGTGGAGTCTCCGGTATCCerGTTGGAGGAACXSACCGGTTQTGaAC 
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Figure 10 continued 



5' 11 . 21 31 41 

CTGGAGCAGG GGCACAGGTC CCAGCAACTGATCCTCAGTG GATGGGTCCG 
" " GACCTCGTCC CCGTGTCCAG GGTCGTTGAC TAGGAGTCAC CTACCCAGGC 

5' 61 71 81 91 

a\GTCy^GCCTTAATGGGCTCTCTTTTGAAGGGGAAAGL?U\AGAATTTCA 
^ oax GTCAGTTTCGGAATTACCCGAGAGAAAACTTCCCCTTTCTTTCTTAAAGT 

5' . 11- 21 31 41 

5 _ - - AGCTTATGATATCCAACATT ATTATAGTTG ATGAGTTAGT AAATTCCAAA 
^ '"-^ TCGAATACTATAGGTTGTAATAATATCAACTACTCAATCATTTAAGGTTT 

5' 61 71 81 91 

c AAAAAAAGATGATTTTATATGTATGACATAAAAAAAATCTTTGTAAAGTG 
TTTTTTTCTACrAAAATATACATACroTAT l Trrri TA GAJ^ 

5* 11 21 31 41 

CGCAAGTGCAATAATTTAAAGAGGTCTTATCTTTGCATTTATAAATTATA 
^ *» "-^ GCGTTCACGTTATTAAATTTCTCCAGAATAGAAACGTAAATATTTAATAT 

5' 61 71 81 91 

2 Q c 1 AATATTGTAC ATGTGTGTAATTTTTCy^TGTATTCyiTTTGCAGTCTTTGTA 
TTATAACATG TACACAGATTAAAAAGTACATAAGTAAACGTCAGT^CAT 

5- 11 21 31 41 

2901 TTTAAAAAAACTTTACTGTTATGTTTGTATAATAGAACATTAATC^ 



5' 61 71 81 91 

2 g c 3 TTATAACTCAGACy^GGTGTAAATAAATTC ATAATTCAAACAGCCAGTAT 

AATATTGAGT CTGTTCCACATTTATTTAAGTATTAAGTTTGTCX3GTCATA 

5' 11 21 31 41 

3 Q Q ATATGCATATATGGGTGTTA CATTGCAAAAATCTCTATCTTTGTTCTATT 

"•^TATACGTATATACCCACAATGTAACGTTTTTAGAGATAGAAACAAGATAA 

S« 61 71 81 91 

3 „ CACATGCTTAAAGAAGTAAG AAATCTTTTG TGGATATGTAATTATACATA 
J uoa. GTGTACGAATTTCTTCATTCTTTAGAAAACACCTATACATTAATAT6TAT 

S' 11 21 31 41 

- . Q- TAAAGTATATATATATGTATGATACATGAAATATATTTAGAAATGTTCAT 
■'■^"•^ATTTCATATATATATACATACTATGTACTTTATATAAATCTTTACAAGTA 

S' 61 71 81 91 

3 ISl -^TTTTAATGGATATTCTTTGGTGTGAATAATTGAATACAACATTTTTAA 
TTAAAATTAC CTATAAGAAACCACACTTAT TAACTTATGT TGTAAAAATT 

5' 11 21 31 41 

3201 AATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
TTACTTTTTTTTTTTTTTlTl"r iri " i TTTTTTTTTl 
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Figure 12 



5' 11 21 . 31 41 

, AAGTGTAAATAAAATAAACATCTAATAAAAAAAATTACATACCATAGAGG 
TTCACATTTATTTTATTTGTAGATTATTTTTTTTAATGTAT^ 

5' 61 71 81 91 

AACAAGATAATTTCTGCCCAACTTCATACCCTCCAGCGTATAGTGTTGAG 
5 1 TTGTTCTATTAAAGACGGGTTGAAGTATGGGAGGTCGCATATCACAACTC 

51 11 21 31 41 

GTTTGGTCTGTTGCTGTGTATTGTAATGTAATGTTAAATTCTCTACCTGA 
^ ° CAAACCAGAC AACGACACATAACATTACTITTACAATTTAAGAGATGGACT 

5' 61 71 81 91 

r AGGTCTAGGC CTACAAGTGAATTCTCATGTTTATAGAGTTTTGTTGTGCA 
^ ^ ^ TCCAGATCCGGATGTTCACT TAAGAGTACAAATATCTCAAAACAACACGT 

S'- 11 21 31 4J. . 

AACCTTGTTCCTTAATTTAARACTATGGTTAAAAAACATULACAARACriXM 
201 TTGGAAOyVGGAATTAAATTTTGATACCAATTTTTTGTTTTGTl^^ 

5- 61 71 81 91 

^, CTACAGCCAATAACTGAAGGGGGTTACCTTGTTGAAGGGGTGGAAAAGAG 
GATGTOMTTATTGACTTCCCCCAATGGAACAACTTCCCCACCTTTTCTC 

5" 11 21 31 41 

^ AGAGGAGGAAGAAGGGAGTTCAAGAGAAGGAGAAGAACAAGAGGAQAGGA 
301 TCTCCTCCTT CTTCCCTCAAGTTCTCTTCC TCTTCTTGTT CTCCTCTCCT 

5' 61 71 81 91 

GGAAGCTGCCACGAGGGGAGATGGGCCATGAGAACTTGGCCAGGAGAAAT 
351 cCTTCGACGGTGCTCCCCTCTACCCGGTACTCTTGAACCGGTCCTCTTTA 

5' 11 21 31 41 

. AGCCAGTATGTGGAGTACACCACTGAGGAGGTAGCCAGGCTAGCAGTTAG 
'* 0 1 TCGGTCATAG ACCTCATCTGGTGACTCCTCCATCGGTCCGATCX3TCAATC 

5' 61 71 81 91 

^ AAGAGTAGATTAGGGGTTATTTTTCCCCCACTCCACATAGTTATCAAAGC 
■^^iTTCTCa^TCTAATCCCCaiATAAAT^GGGGGTGAGGTGTATCAATAGTTTCG 

5' 11 21 31 41 

CAAATAAAATAACCATAGTCTGAGTCTCATCTATTTGTAAGCTAGTTGGG 
501 GTTTATTTTATTGGTATCAGACTCa^GAGTAGATAAACATT CX3ATCAACCC 

5 1 61 71 81 91 

_ TATAAGATTAATTTGGCTGTACTACAGTTTAGATTTCTAACATAGGAACT 
ATATTCTAAT TAAACCGACATGATGTCAAATCTAAAGATTGTATCCTTGA 

5' 11 21 31 41 

ATCAAAAACTTGCTCAAACAAGAACATGCTGACAATATTTTAAAATGATT 
^ " ^ TAGTTTTTGAAOSAGTTTGTTCrrGTAaaACTGTTATA^ 
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Figure 12 continued 



S' 61 71 81 91 

ATTTATATTGTTTGCACTTTCTAAAGTTTCTTCTAAATGTTCCATGGTCA 
" ^ TAAATATAAC AAACGTGAAAGATTTOUAGAAGATTTACAAGGTACCAGT 

5' 11 21 31 41 

AATTAAAA/UITATACATATTGGCTATTAT^TTCGTCTAAGTGGGGCTGQA 
' " ^ TTAATTTTTTATATGTATAACOSATAATTTAAGCAGATTCACCCaSACCT 

5' 61 71 81 91 

„ c 1 GAGATAGCTCAGAGGTTAAG AGCACTGACTGCTCTTCCAGAGGTCCTGAG 
' = ^ CTCTATCGAG tCTCCAATTC TCGTGACTGACGAGAAGGTCTCCAGGACTC 

5' 11 21 31 41 

o n 1 TTOlATTCCCAGaSACCACATGGTGGCTCACAGCCy^TCTGTAATAGATAG 
" AAGTTAAGGG TCGCTGGTGT ACGACCGAGTGTCGGTAGAC ATTATCTATC 

S' 61 71 81 91 

- c , GATCTGACGC CCTCTTCTGG AGTGTCTGAAGACAGCTACAATGTACTCAT 
^^••■CTAGACTGCGGGAGAAGACCTCACAGACTTCTGTCGATGTTACATGAGTA 

5' 11 21 31 41 

Oft, ATATATTA7VA TAAATAATAT TAGAAAATTC TTCTAAGT6T ATCATTTATA 
S 01 TATATAATTTATTTATTATAATCTTTTAAGAAGATTCACATAGTAAATAT 

5« 61 71 81 91 

oai GAATATTTAATATATAAAGT AAATGCCTCAGGAAATATAAACTTGGAATT 
352. CTTATAAATT ATATATTTCATTTACGGAGT CCTTTATATTTGAACCTTAA 

5' 11 21 31 41 

- -rt- AAATCAAAGAACTTCATGAGTAGTGGGCCACAATU^AATGTGTACCAGGGG 
1001 tttagtttcttgaagtactc ATCACCCGGT GTTTTTTACACATGGTCCCC 

5* 61 71 81 91 

. AAGACCGGAGGGAGGGGAGAAGGAAGGGATGGAGATAGAATTTTGCCTCT 
1051 TTCTGGCCTC CCTCCCCTCT TCCITCCCTACCTCEATCTT AAAACX3GAGA 

5' 11 21 31 41 

- - GCATTCCTTGGGCTGGCACAGGTATAATGCTGTGGGAATTGGGAT^CTAC 

CGTAAGGTACCaSACCGTGTCCATATTACraACACCCrTAACCCT^ 

5' 61 71 81 91 

. - CI AAGGAAGCTGCAAAGCTGGGCGGAACTCGTTTCCGCAAGCTGGGCTCATC 
115 1 TTCCTTCGACGTTTCGACCCGCCTTGAGCAAAGGCGTTC6ACCCGAGTAG 

5' 11 21 31 41 

TAAGTGTCCATGCATGGCTGCCACACTGCAGTGAACTTTAAAACATTTGT 
■'•'^ ATTCACAGGTACGTACaSACGGTGTGACGTCACTTGZUU^TT^ 

5* 61 71 81 91 

. , c -1 GTTCCAGAGATGTAGAGATG CTCACAATAG TACAAAGGCG GGAGGGAGGT 
•^"^ * CAAGGTCrCTACATCTCTACGAGTGTTATC ATGTTTCCGC CCTCCCTCCA 
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Figure 12 continued 



5' 11 21 31 41 

, 3 Q, ATTTCCAGACTAAGAGQAAGAAAAACCATTGCTSATTAAACATCTGCATA 
TA?IAGGTCTGATTCTCCTTCTTTTTGGTAACGACTAATTTGTAGACX3TAT 

5' 61 71 81 91 

13 S 1 TGAGCGCCCCCACCTCCyiTACACACACACACTVCACACACACACACACAC^ 
^-^ ACTCGCGGGGGTGGAGGTATGTGTGTGTGTGTGTGTGTGTGTGTG'i'GTGT 

5' 11 21 31 41 

. ^ _ - CAACCAAACAGAACAAATACACATGCATGT CTACAGCCTGCAGGAACAAA 
GTTGGTTTGTCTTGTTTATGTGTAQGTACAGATGTCGGAC6TCCTTGTTT 

5' 61 71. 81 91 

, . 51 ATGGTATGTCTGTGAGGAACCAGGAGATGCACAGGTCCTAACCTCTGTCT 
^ TACCATACAGAO^CTCCTTGGTCCTCTAaSTGTCCAGGATT^ 

5' 11 21 31 41 

CCTAO^GCCCTGAAGTCTGGTCaGGGTCAAATGTACAAAAGCAGGCrAA 
"^^ ""^ GGAtGTTCXSGGACnra^GACCAGTCCa^GTTTACATGTTTTCGTCGGATO 

5' 61 71. 81 91 

- ccn GGAAGCTGTTTAGTGAAAGATTTTTTTCTTCa^CrCTAGGAAa\ACCT 
■^^^"^ CCTTCQAOlAATCACiTTCTAAAAAAAGTVAGTTGAGATCCTTGTTGGA 

5« 11 21 31 41 

T^ni TTCCTAGGATTTGQAGAGTGCTCAGGAGGAAACyiTTCAGACAACTGATGC 
"-^ AAGGATCCTAAACCTCTCACGAGTCCTCCTTTGTAAQTCTGTTGACTAC^ 

5« 61 71 81 91 

- g-. TCTCTGTGTACCCCa.GATTCAGGTATTGGGGTAGTTAGTTGTGCTCaiTGT 

^"^ AGAGACACATGGGGTCTAAGTCCATAACCCCATCAATCaACACGAGTACA 

5' 11 21 31 41 

ATGTGCTAGATATATTAGCACAGCCTGCCTTCTGCTGCACAACGCCTTAG • 
X / ux TACa^CGATCTATATAATaSTGTCGGAOSGAAGACGACGTGTTGCXSGAATC 

S« 61 71 81 91 

. AGACCCXSGCCTTTCAATGAGCTTAGCTTGTGCTCTGTTTCTGCTCTCTTA 
X /»i TCTGGGCCGGAAAGTTACTCGAATCGAACACGAGACAAAGACGAGAGAAT 

5' 11 21 31 41 

, GGTCT/y^ACTATGGTGTCAGTTTTAATAGAACAAAAGTATGCATCTTGCC 
" CCAGATTTGA TACCACAGTC AAAATTATCT TGTTTTCATACGTAGAACGG 

5' 61 71 81 91 

T pc- TTGGCTTGAGCCTTTTCGTTTTCAATGCTGACTTCTCCCCTTTCTCTCCT 
"^''^■^ AACCGAACTCGGA/UiAGCSUUUlGTTACXSACTGAAGAGGOGAAA^ 

5' 11 21 31 41 

-IQQ-. GTGCTCACCTTACCTTTCCy^GAGTGTAAGGGACa^CTTTTAAGGAGGCGT 
CACGAGTGGAATGGAAAGGTCTCACATTCCCTGTTGAAAATTCCTCCSGCA 
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Figure 12 continued 



S« 61 71 61 91 

, Qsi GTCCCTGGTAGGGGCATCCCTGTTCACCAGGTGCCTGTCATCACCCCACT 
^ CAGGGACCATCCCCGTAGGG ACAAGTGGTCCACGGACAGTAGTGGGGTOA 

5' 11 .21 31 41 

^ TGACTGACATCTACCCTGGTGACTATGGGTTCCTCTTGTTTGTAGGGAAC 
^ " "-^ ACTGACrGTAGATGGGACCACrGATACCCAAGGAGAAaUU^CAT(X:CTTG 

5' 61 71 81 91 

- - c 1 GGTGGCTCCAGGTGGAGGCATCAATCTGTTGGGTTCTGGTTCCCGGCTGC 
" ^ CCACa3AGGTCCACCTCCGTAGTTAGAa^CCCAAGACCAAGGGCCX3ACG 

5' 11 21 31 41 

CTTTGGTTTTGAAAGTCTCTTCTCTGTATATTCCTACCCTGOITTT^ 
" GAAACCS^AAACTTTCAGAGAAGAGACATATAAGGATGGGACGTAAACGAA 

5' 61 71 81 .91 

^. ci TGTGTGGTGCTGATGCTGTGCraCAGCAGGATTCTTGGATGACTCTCa^ 
^•^^•^ACACACCACGACTA.a3ACACGCGTC6TCCTAAGAACCTACTGAGAGGTAG 

S' . 11 21 31 41 

AGTCACAGACTCCCCCTGTTGCAAAGTGTCAGGCTGACTCGACAGTCACC 
u X T(:AGTGTCTGAGGGGGACAAC:X3TTTCACa.GTCCGACimGC^ 

5' 61 71 81 91 

GTAAAATCTGAGTCAGTCACACaVCAGGCTGTCAGCCACGGCTTCCT^CTTG 
Ca^TTTTAGACTCyVGTCAGTGTGTGTCCGAC AGTCGGTGCCGAAGGTGAAC 

5« 11 21 31 4i 

CATGGCTATTCTATTTTCACACGTGAGTTTCTGTTGCTGGCTGGCTGACT 
^ J ux GTAC(:^TAAGATAAAAGTGTGCACTCAAAGACAACGACCGACCGACTGA 

5' 61 71 81 91 

^ - P 1 GGCATTATCTATGCTAAGTTGAAATCAGGGGTGCCCAGCAGAGCCCATCA 
^ ^ CCX3TAATAGATACGATTCAACTTTAGTCCC CACX3GGTCGT CTCX3GGTAGT 

5* 11 21 31 41 

lA.ni TTCTCACTGTCTTTGAAACAAAGCTGTACGGTTTGATCGATGAACGTATT 
* " AAGAGTGACAGAAACTTTGT TTCGACATGC CAAACTAGCT ACTTGCATAA 

5' 61 71 81 91 

, . ^ 1 TAAAGCATTTCATGCAATGACAAAGTGCTCAGTAGTGGAAGGCAGGCTGT 
^ « b X ATTTCGTAAAGTACGTTACTOTTTCACGAG TCATCACCTT CCGTCCX3ACA 

5' 11 21 31 41 

, c A 1 GACCyiGTCTGCCnXSCTCCTTACTATAATTGTGAGGATTTGTTACTGGAAC 
^^"•^CTGGTCAGACGGACGAGGAATGATATTAACACTCCTAAACAATGACCTTG 

5' 61 71 81 91 

AGTACATGGAGGCCTGACCTTGTGGGGGCACyiGGGTGGAACCrTAGCTGA 
^ ^ TCATGTACCTCCX3GACTGGAACACCCCCGT GTCCCACCTTGGAATCX3ACT 
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Figure 12 continued 

11 21 31 . 41 



2601 ATATAGTGTG TGTCTCAAGAGGAAGTCAGG GTACTAGCT.C AGTCCTCAAT 
TATATaiCACAa^GAGTTCTarrTCAGT(X;CATGATCX3AGTCA 

5' 61 71 81 91 

2651 CTCCAGGTACTATATATACATTTGCCCGTTTTATCTCTAATGTGAAATAA 
GAGGTCCATGATATATATGTAAACGGGCA7VAATAGAGATTACACTTTATT 

5' .11 21 31 41 

2701^'^^^^^^^'^^'^^^°TTTATC«TGTAGCGTACCTAAAA(3ACTATTCTATT 
TAGGGGTTTG TGAACAAATAGCACATCGCATGGATTTTCTGATAAGATAA 

5' 61 71 81 91 

275, ATGGGTGTCC CCACTTTCTTGGTTTGGTCA CCCCGATCCC CCGGTCTTCT 
TACCCACAGG GGTGAAAGAACCAAACCAGTGGGOCTAGGGGGCCAGAAGA 

S' 11 21 31 41 

2305 GCTGTATCTAGAACAGTGACTATAAATGATGTATGGGAATAGTGTTTCCA 
CXSACATAGATCPTOTCyiCTGATATTTACrrAa^ 

5' 61 71 81 91 

2 a s 1 TATGATCTGT TGTCTGGAGT ATATGCTACATGTTO^TTTACTGTACAAAA 
ATACTAGACTUlCAGACCTCATATACXSATGTAa^GTAAATGAavroT^ 

5' 11 21 31 41 

2Qoi-^CCCa^GTGCAGCTGATGATGa^GCyVGTCTCTCTCTGTGTACAGTGC^ 
TGGGTCACGTCGACTACTACGTTTCXSTCAGAGAGAGACACATGTCACGGG 

5' 61 71 81 91 

2 9 S 1 C^CCTATTTAAA/yiTCACGTACTTGCCCAGAACACTGTGAAACACTTAAC 
GTGGATAAATTTTTAGTGCa.TGAACGGGTCTTGTGAGACTTTGTGAATTG 

S' 11 21 31 41 

3001 ATAAGAACy^AACJGCAGOSTCTGGATTCTTTCCAAGGAGAGCAGCTTTCT 
J u w X TATTCTTGTTTGCGTCGCS^GACCrAAGATU^GGTTCCrCTCGTCGT^GAG 

S' 61 71 81 91 

3 OSl P^<^GGAACyVCAGTAA(:auyiAGAGGTCCGCCGCCATCCa^CACCCAGCCa^ 
* GTGTCCTTGTGTCATTGTTTTCTCCAGGCGGCGGTAGGTGTGGGTCGGTT 

S' 11 21 31 41 

3101 GACACCTCAGAGGCCy^TAGGGAaUVCCTCCTTGCTGOCCAACACCrGCTG 
CTGTGGAGTCTCCGGTATCCCTGTTGGAGGAACGACCGGTTGTGGACGAC 

S'- 61 71 81 91 

3151 GAGCAGGGGCACAGGTCCCAGCAACTGATCCTCAGTGGATGGGTCTGCAG 
■^CTCGTCCCCGTGTCO^GGGTCGTTGACTAGGAGTCACCTACCCAGACGTC 

S' 11 21 31 41 

3201 CCAAAGCCTTAATGGGCTCT CTTTTGAAGG GGAAAGAAAG AATTTCAAGC 
GGTTTCX3GAATTACCCGAGAGAA7UICTTCCCCTTTCITTCTTAAAGTTCG 
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Figure 12 continued 



5. 61 71 81 91 

325lJjTASATAGGTTATAATAATATCy^CTACTCAATaiTTTJA^ 

c. 11 21 31 41 

AAAAGATGATTTTATATGTATGACATAAAAAAAATCTTTGTAAAGTGCGC 
3301™:5CTACTAAAATATACATACTGTATTTTTTTTAGAAACAm 

5. 61 71 81 91 

AAGTG(^TAATTTAAAGAGGTCTTATCTTTGCATTTATA^ 
3351 ^5^cGlTATTAAATTTCTCCAGAATAGAAACX5TAAATATTTAATATTTA 

c. 11 21 31 41 

ATTGTACATGTGTGTAATTTTTCATGTATTCATTTGCAGTCTTTGTATTT 
3401 TAACA?OTACACAaiTTAAAAAGTACATAAGTAAAC»^ 

5. 61 71 81 91 

5. 61 71 81 91 



c. 11 21 31 41 

TTTAATGGATATTCTTTGGTGTGAATAATTGAATACAACATTmAA^ 

^^oiJSttAcctataagaaaccacacttattaacttato 

5. 61 71 81 91 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATTTTTOITT^ 

3751 ^I^I^I^Tim^ 

51 11 21 31 41 

3 801 JJtaaggtctctaatttctgtgatc^^ 
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Figure 12 continued 



S" 11 21 3i 41 



5 ' J.J. <6X •»•«. 

ACaiTGCa^GTCACTTTCCTGATTGCrcn'CACAT 
35°^TGTAC»TCyVGTGAAAGGACTAACGAGAAGTGTAGGAGTTCCX3AGGCCrr^^ 

S« 61 . 71 81 91 

TCCGGGGGTGTGGTGGGCTTTGATCTCAGGACTCTGGAGGCAGAAGCAGG 
^^^■^ AGGCCCCCACACCACCCGAAACTAGAGTCCTGAGACCTCCGTCTTCX3TCC 

51 11 21 31 41 

CAGATCTCTGTGAATATGAGGCCAGCCTGCACTACACAGAGCTCCAGACC 
4001 GTCTAGAGACACTTATACTC CGGTCGGACGTGATGTGTCTCGAGGTCTGG 

51 61 71 81 91 

AGTCATGGCTACATCATGAAACCCTGTCTCAAAAAG AAAATAAAAA CTGT 
4051 TCAGTACCGATGTAGTACITTGGGACAGAGTTTTTCrrrrArrrriX^ 

51 11 21 ■ 31 41 

TGTGTTTCTACCATAGTGTTAAACrCAGAGTCTGAGTAATGTCGGGCTGA 
* ^ ° ACACAAAGATGGTATCACAATTTGAGTCTCAGACrCaiTTACaUSCCaSACT 

5. 61 71 81 91 

(aVTGCTCGGGTGlTTAACaiTACCTTCAGCTTTGACGAGGCGCTQAACAGT 
4151 GTACGAGCCC ACAAATTGTATGGAAGTCGAAACTGCTCCX3 CGACTTGTCA 

51 11 21 31 41 

CAAAGTCTGG CCTTGGGGAG CGGTGGCTGTGTTTGTGCTCT^GTCCACCG 
420lQ^^Q^Q;^CCGGAACCCCTCGCaiCCGACACAAACACGAGTTCAGGTGGC 

71 81 91 

•TGGACAACCGTG 
ACCTGTTGGCAC 

51 11 21 31 

r CCAACTTCATGTTGGTCATTTT 
IGGTTGAAGTACAACCAGTAAAA 

61 71 81 91 

lATATACTGCC ATTCCACATATGTAGA6ATG TA 
•TATATGACGGTAAGGTGTATACATCTCTACAT 

5« 11 21 31 41 

:aatcgaatgctcttgatcatgc 
jttagcttacgagaactagtacg 

71 81 91 




51 11 21 •*J- 

CATGCAACCT CCAACTTCATGTTGGTCATTTTGTGAAAACACTGTGTGAT 
4301 GTACGTTGGAGGTTGi^GTACAACCSlGTAAAACAGTTTTGTGACACACTA 

5- 61 71 81 91 

GTTTTTATCAATATACTGCC ATTCCACATATGTAGA6ATG TAGTCTGCCT 
43 51 CAAAAATAGTTATATGACGGTAAGGTGTATACATCTCTACATCAGACGGA 

5« 11 21 31 41 

GGCTTTCCTTTTCTTTAGCCAATCGAATGCTCTTGATCATGCCCTCAATC 
°^ CCGAAAGGAAAAGAAATCGGTTAGCTTACGAGAACTAGTACGGGAGTTAG 
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Figure 12 continued 



5' 61 71 81 91 

4551 CATTCAGTTTCTCGTACTCCTCCATGTaVAAGTCACTGACACACTCATCG 
"» 3 3 J. GTAAGTCAT^GAGCATGAGG AGGTACAGTTTCAGTGACTGTGTGAGTAGC 

S' 11 21 31 41 

- ^ o T TCATTGGTGTAGGAAAGCTG CTCTTTGGTAATCAGTTCCTTTAGCCAGGA 
* "-^ AGTAACCACATCCTTTCGACGAGAAACCATTAGTCAAGGAAATCGGTCCT 

5' 61 71 81 91 

.gc, GATTGTTTTGTTCACACTGTCTACCCCTGAACCACATACCTGGAAAACTG 
* " CTAACy\A;^C7^GTGTGACAGATGGGGACTTGGTGTATGGACCTTTTGAC 

5' 11 . 21 31 41 

. „Q. TGTGCTCTATTTTCTTTTCCAAAACCAGGGTGTTCrTTTTGGGGGAAGCT 
ACACGAGATAAAAGAAAAGGTTTTGGTCCCACTUVGAAAAACCCCCTTCGA 

5' 61 71 81 91 

4751 TGCTTGGGAAAGCCAAGAAAGGCTAT^GAGAAAATGGAAATTAATGTTTC 
ACQAACCCTT TCGGTTCTTT CCGATTTCTC TTTTACCTTT AATTACAAAG 

5' 11 21 31 41 

TTTTACTCCC TTCAACATCAAGGTTAGGAATAT6TATTTC ATAAAAGCTA 
AAAATGAGGGAAGTTGTAGTTCCAATCCTTATA(aTAAAGTATTTTa3AT 

S' 61 71 81 91 

> « c 1 ACAACTCACAGGCAATCTTAGACATCACTGACTGCTTGGC AGGCGACTGC 
« o a X TGTTGAGTGT CCGTTAGAAT CTGTAGTGAC TGACGAACCGTCCQCTGACG 

S' 11 21 31 41 

4 TTGGGGGGAGCTGGAGAGCCTTCTCTTTCTTTCATGTTGTCX3TAAAAAAA 
* ^ " AACCCCCCTC GACCTCTCGG AAGAGAAAGAAAGTACAACAGCATTTTTTT 

5' 61 71 81 91 

.gen TTGCT^GAATATGGGGCTGGAAGATAACa^CTTTAACrCTCTTCaiCAGCCT 
AAC6TCriTATACCCCGACCTTCTATTGTTGAAATTGAGAGAAGTGTCX3^ 

S' 11 21 31 41 

_ _ . GCACTGATTTTTTCTGGACAAATTCTTCAATGGCATCTAT TATCGCTTTT 
3 u u i CGTGACTAAAAAAGACCTGT TTAAGAAGTT ACCGTAGATAATAGCGAAAA 

5' 61 71 81 91 

CO CI GCTACTACGTTTGGGTCCTGTTGAGCATTTCCTTCAT^AAACAAAAAAAGC 
c^ATGATGCA/yi^CCCaiGGACAACTaSTAAAGGAAGTTTTTGTTTT^^ 

5' 11 21 31 41 

0- ACATTTTTATlAAAGTCAAGGTTAAGATCCACCTGCSU^AAAAAAGCTGa^ 
3 X u X TGTAAAAATT TTTCAGTTCC AATTCTAGGT GGACGTTTTT TTTCGAC6TT 

5' 61 71 .81 91 

5151 TATAAGCGAGGAATTCTAGT TGTCACAGGAAATAAAAATC TCTGTTCCCA 
ATATTCraCTCCTTAAGATCAACAGTGTCCTTTATTTTTACAGACAAGGGT 
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Figure 12 continued 



5' 11 21 31 41 

P _ „ . CTATAATCAATGTAGACTGATAATATTATG CCAGCAAATAGTTTTGAAGT 
""^ GATATTAGTTAOlTCnX3ACTATTATAATACGGTCX3TTTATCAAAACrrTCA 

S' 61 71 81 91 

_ - ^ 1 CCTAGGCACAGTGGGAGGAGGTTTTGTTCCACGCTGTTCATAAGCCAATA 
^■^^■^ GGATCCGTGT CACCCTCCTC CAAAACAAGG TGCGACAAGTATTCGGTTAT 

5' 11 21 31 41 

c^n-i CCCCAGCT^AAAGACCTTAAAGGACAACniXSTAATTTGGQACATTCAC^ 
^•^ GGGGTOSTTTTCTGGAATTTCCTGTTGAACATTAAACCCTGTAAGTGTAQ 

S* 61 71 81 91 

TGTCCTCTTCATCTGATCTGGCTCCCAGTGTCACTCTCTAACACGGTCCT 
^ ^ ACAGGAGAAG TAGACTAGAC CGAGGGTCAC AGTGAGAGAT TGTGCCAGGA 

5« 11 21 31 41 

_ . TAGAGGGACyiATTTATCCCTGCCTCTGCTTGATCTTATGCATGTATCTGT 
•j 4 W -L ATCTCCCTGTTAAATAGGQACGGAGACGAACTAGAATACGTACATAQACA 

5' 61 71 81 91 . 

- . - . ATTCTTCCAG CCATCCCTGG CGACCTGATTTTTCTAAGGC ACCCAAAACT 
^^^•^ TAAGAAGGTCGGTAGGGACCGCTGGACTAAAAAGATTCCGTGGGTTTTGA 

5' 11 21 31 41 

c e m GTAAGCTACTTCTTATAATCTATAATTCTG AGCATATTAQTTAGCCTQAG 
^^"■^CATTCGATGAAGAATATTAGATATTAAGACTCGTATAATCAATCGGACTC 

5' 61 71 81 91 

ccK-i CCTCCAGQAT ATCTTTCTTC CCTATACTCAGTCCAGTTTTAGCTGCCCAG 
= * = ^ GGAGGTCCTATAGAAAGAAG GGATATGAGT CAGQTCAATATCQACXSGGTC 

5' il 21 31 , 41 

c c m AAGGATTCaULAGCTGATCTACGAGTAGATCACTCCTGTCTACAGCTTGTT 
i> b Ul TTCCTAAGTTTCGACTAGATGCTCATCTAGTGAGGACAGATGTCGAACAA 

5' 61 71 81 91 

c c e 1 CCAGATCTTGTTTCTCAAGC CCTGGAAGCC ATCAGCCAGG TAAGATTGTA 
= = ^ GGTCTAGAAC AAAGAGTTCG GGACCTTCGG tAGTCGGTCC ATTCTAACAT 

5' 11 21 31 41 

_ ^ n . AAACAATCCCTTTCTAATCATGGGTGTGGC CCAAAGTGAATGGCCGGAAT 
5701 TTTGTTAGGG AAAGATTAGT ACCCACACCG GGTTTCACTT ACCGGCCTTA 
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Figure 16 



PDBlOa and RACBs compiled 

1 CGCCCGGGCA GGTCTGTTGG AGGGCAGTTG GTCAACCTGA CCAGAGAGAG CTGAGCTGGA 
GCGGGC CCGT CCAGACAACC TCCCGTCAAC CAGTTGGACT GGTCTCTCTC GAC TCGAC CT 
61 AGACCCCACT GATGGTGTGC TGCCTTTCAG TCCAGGAAGA AAGAAA6GAA GGATTCTGAG 
TCTGGGGTGA CTACCACACG ACGGAAAGTC AG G TCCTTCT TTCTTTCCTT CCTAAGACTC 
121 GATTTGGGCA AAGCCACATT CCTGGAGAAG TCTGTATACT GATGCCAAAC CCAAGAGCTG 
CTAAACCCGT TTCGGTGTAA GGACCTCTTC AGACATATGA CTACGGTTTG GGTTCTCGAC 
AGCTGCTGAT GAGGCCCAGG GAGTAGCCCA- CGCGCCCTGA GCTGTTGGCT AGCAAGGCCT 
T CGACGACtA CTCCGGGTCC CTCATC GGGT G CGCGGGACT CGACAACC GA TCGTTCCGGA 
TCCTGCTCCA TGTGGCATGG AAAAATlTATA TGGTTTGACG GATGAAAAGG TGAAGGCCTa" 
A GGACGAGGT ACACCGTACC TTTTTAKTAT ACCAAACTGC CTACTTTTCC ACTTCCGGAT 
301 TCTTTCTCTC CATCCCCAGG TATTAGATGA AT TTGTTTCT GAAAGTGTTA GTGCAGAGAC 
A GAAAGAGAG GTAGGGGTCC ATAATCTAC T TA AACAAAGA CTTTCACAAT CACGTCTCTG 
361 TGTGGAAAAG TGGCTGAAGA GGAAAACCAA CAAAGCAAAA GATGAACCAT CTCCCAAGGA 
ACAOCTTTTC ACCGACTTCT CCTTTTGGTT GTTTOGTTTT CTACTTGGTA GAGGGTTCCT 
421 AGTCAGCAGG TACCAGGATA CGAATATGCA GGGA6T0GTG TA0GAGCT6A ACAGCTACAT 

TCAGTCGTCC ATGGTCCTAT GCTTATACGT COCTCAGCAC ATGCTCGACT TGTCGATGTA 

481 AGA6CAG0GC CTGGACACGG GCGGGGACAA CCACCTGCTC CTCTATGAGC TCAGCAGCAT 

TCTCGTCGCG GACCTGTGCC CX5CCCCTGTT GGTGGAOGAG GAGATACTCG AGTCGTCGTA 

541 CATCAGGATA GCCACAAAAG CCGACGGATT TGCACrGTAC TTCCTTGGAG AGTGCAATAA 
GTAGTCCTAT CGGTGTTTTC GGCTGCCTAA AQGTGACATG AAGGAACCTC TCACGTTAI^T 
"ioT ' TAGOCTGTGT GTGTTCATAC CAC0CGG6AT GAAGGAAGGC CAACCCCGGC TCATCCCTGC 
ATGGGACACA CACAAGTATG GTGGG CCCTA CTTCCTTCCG GTTGGGGCCG AGTAGGGACG 
€61 AGGGCCCATC ACCCAGGGTA CCACCATCTC TGCCTACGTG GCCAAGTCTA GGAAGACGTT 
T CCCGGGTAG TGGGTCC CAT G GTG GTAGAG ACGGATGCAC CGGTTCAGAT CCTTCTGCA A 

£ooRV . /?y^. ^ . - . . 

721 GTTGGTAGAG GATATCCTTG GGGATGAGCG ATTTCCTCGA GGTACTGGCC TGGAATCAGG 
CAACCATCTC CTATAGGAAC CCCTAC TC6C T AAA6GAGC T CCATG ACO GG ACCTTAGTC C 
781 AACCCGCATC CAGTCTGTTC TTTGCTTGOC CATTGTCACT GCCATTGGAG ACTTGATTGG 
TTGGGCGTAG GTCAGACAAG AAACGAACGG CT^9^^^5TGA CGGTA ACCTC TGAACTAACC 
841 CATCCTTGAA CTCTACAdG^^ AGAGGCXTTTC TGCCTCAGCC ATCAGGAGGT 

GTAGGAACTT GACATGTOCG TCA OCCCGrTT T CTCCGGAAG ACX5 G AGTCGG TAGTCCTCCA 

9oi "tgcaacagccT aatcttgctt gggcttccgt agcaatacac caggtgcagg tgtgtagagg 

ACX5TTGTCGG T TAGAA CGAA CCCGAAGGCA T0GTTATGT6 CTqCAqSTCC ACA CATCTC C. 
961 TCTCGCCAAA CAGACCGAAC T<bvATGACTT CCTACTC6AC GTATCAAAGA CATACTTTGA 

AGAGCGGTTT GTCTGGCTTG ACTTACTGAA GGATiUlGCTG CATACTTTC T GTATGAAACT 
1021 TAACATACTT G^^ ATATATGCAA AAAATCTAGT 

ATTGTATCAA CGGTATCTGA GAGATGAACT TGTGTAG TAC T ATATA CGTT TTTTAGATCA 
1081 GAACGCCGAC Vg^ AACAAGGAGC TGTACTCGGA 

C TTGCGGC TG GCGACGCGCG A GAAGGTCCA CCTGGTGTTC TTG TTCCT CG ACATGAGCCT 
"ilii ' CCTGTTTGAC ATTGGGGAGG AGAAGGAGGG GAAGCCCATC TTCAAGAAGA CCAAGGAGAT 

GGACAAACTG TAACCCCTCC TCTTCCTCCC CTT^S^^J.^G ^AGTTCTTC 
1201 CAGATTTTCC ATTCAG^ AGAACAGGCG AAGTCTTGAA 

GTCTAAAAGG TAACTCTTTC CCTAACGACC AGTTCACCGT TCTTGTCCGC TTCAGAACTT 

1261 TJattcccgaF gcctacgcgg'accctcgctt taacagggag GTGGACCTGT ACACAGGCTA 
gtaagg gcta cggatgcgc c tgxksag<^yi attgtcxxnc cacc tggaca tgtgtccgat 

T32I CAciS GGCAGCGTGA TTGGCGTGGT 

GTGGTGCTCC TTGTAA6ACA CATACGGGTA TCACTCGGCT CCGTCGCACT AACCGCACCA 
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Figure 1 6 (con't) 



PDBXOa and RACES compiled 
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GCAGATGGTG AACAAGATCA GCGGTAGCGC CTTCTCCAAG ACAGACGAGA ACAACTTCAA 
CGTCTACCAC TTGTTCTAGT CGCCATCGCG G^AGAGGTTC TGTCTGCTCT TGTTGAAQTT 
— — • . . 

GATGTTTGCT GTCTTCTGC6 CACTGGCCTT GCACTGTGCT AACATGTACC ACAGGATCCG 
CTACAAAOGA CAGAAGACGC GTGACCGGAA CGTGACACGA TTGTACATC^^ 

CCACTCAGAA TGCATCTACA GGGTTACCKT GGAGAAGCTT TCCTACCACA GCATCTGCAC 
GGTGAGTCTT ACG TAGATGT CCCAAT6GTA.<yTCTTffAA CGTAGACGTG 
6t6cGA«SAG TGTOAAGGOc' t^^TGCGCTT CAACCTACCA GCACGCATCT GCCGGGACAT 
GaScTCCTC .>rvv:rmnCG AGTACGCGAA GTTGGA TGGT CGTGCGTAGA CGGCCCTGTA 

'■SgA^ATTC CACTTTGACa'ttGGTCCT"' CGAGAACATG IGeoCTGGGA TCTTT6TCTA 
GCTo SriSG GTGAAACTGT AAC CAGgAAAJjCTCCTG TAC AOQGGAOCCT AGAAACAGAT 

"CATGATOCAT CGGTCTTGTG GGACATCCTG TTTTGAACTT GAAWATTGT GCCGTTTTAT 

S^nr^n ^^^j^^^^ >^>r^Ac aaaacttgaa crrrryAACA oggcaaaata 

"" CATGTCTCTG AAGAAGA&CT A TCGGOGGGT TOCTTACCAC AACTGGAAGC ATGCAGTCAC 
^S^^r^ TAGOOGOOCA AGGAATGGTG TTGACCTIOG TAOGTCACTG 

crrCGCACAC TGCAIGTATG OCATACTXCA AAACAACAAT C^GCXTCTTCA CAGApCTCGA 
ggggggg J^^Z ggATGAAGT TTXGTTGTTA OOGGAGAAGT GtCTGGAGCT 

jgrg 

• Zl^IS^r'^ ogarCTOCAr CCTTCAGCTG GA AGGGCACA ATATCCTCgC 

«^a««5&GT ACGAGCAGGT GCIGGAGATC ATOCGCAAAG OCATCATOGC 

.^^^m/^ft-v^n rSATGAAGAAG CTGGGCATAC A60CCATTCC 

sigS'^sjs 
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Figure 15 (con't) 



POElOa and RACEs compiled 



2701 CGTCGCATAT CCATGTGAAG CAGACGACTC CqTGCTTGCC 6CACACACCT CGGACAGTGA 

GCAGCGT ATA GGTACACT TC GTCTGCTGAG Gg.ftgg^Fg^ ^^^9^-^^-°^^ GCCTGTCACT 
2761 GCAACCCAGg" CTCTGCCGTG TTCA6ACGTC GGCTACTCCG TGGCTCX7VCC TGACCTCCGA 

CGTTGGGTCC GAGACGG CAC AAGTCTGCAG CCGATGAGGC ACCGAGGTGG A CrGGAGGCt 
2821 MGCTAiiTG'"cTCOCAGG^ AGCACTGCAC TGTCT6GAGG GGGCAGAGAC CACA6GAGAG 

TAC GATAAAC GAGGGTCCGG TCGTGACGTG ACAGACCTCC CCCGTCTCT6 GTGTCCTCTC 
2881 GTTCTTGCCT GCATCCTCCC ATGAGGGTGT GGCCAGTTCC CTAGTTCTCT GCCATGCTGC 

CAAGAACGGA CGTAGGAGGG TACTCCCACA C CGGTC AAG6 GATCAA GACA C GGTACGA CG 
2941™ TGCTTGGTGG CATTGGTTAG GAATGG6ACA CACGCCCCTT GTTGTGAAGT' TTACATGTGA 

ACGAACCACC GTAACCAATC CTTACCCT GT GTW6GG6M JCAACACTI^ A ATGTACACT 
300i" CCTTCTTATA 6GTTAACT«i GTTTGTGGCC* T6GACACATG TAATGAAGGT CACAGTCCAC 

GGAAGAATAT CCAATTGACT CAAACACCGG ACCTGTGTAC ATTACTTOCA GTGTCAGGTG 

3061 "aggtgAcaga gaaatccaaa ctgttgatta'caggtgcact acaggtatgc tctttcagtc 

TCCACTGTCT CTTTAGGTTT GACAACTAAT GT0CACGT6A TGTCXaiTACG AGAAAGTCAG 
3i2r~TATCTGG(3G6 CACATAGGTG AGTCXGCTOC ACTCAGAAUM AAGCaiTACCT CTGOOCTCAT 
ATAGACCXXX GTGTATCCAC TCAGACGAGG TGAGTCTTHN TTOGTATGGA GAOGGGAGTA 
■3I8I CCAGGGGACA CAGG6TACAT CCCAGGCATC GGGGAACTGA AGCTCTCACT TCAAACCATG 
GGTCCCCTGT GTCCCATGTA GGGTOCGTAG CCCCTTGACT TOGAGAGTGA A GTTTGGTAC 
3^41" TCAAAGAATT AAAACACCIC CCCTCCCCCT CACTGTAGCtt TXC6ACAACT GOGOCAATCC 
AGTT TCTTAA TTTTGTG6AG GG6AGGGGGA GTGACATOGG AAGCTgTTGA CGOGGTTAGG 
bToi" CTTTATACAA AGAAAATAAA AGTAAGGCAT ATAAATTTCC TCCAGCAAGC AAATCTTGTG 
GAAATATG TT TCTTTTATTT TC ATTOOSTA TAT TTAAAGG AGGTCGTTCG TTTAGAACAC 
■ GCTAAAAAAA AAGCATGT6A ATNNTAACAA* CNTCTAMAHT HTCMCHGHAT GTTATGGCAG 
CCA TTTTTTT TTCGTACACT TANNAtTGTT GMA(gffNTMA NAGNGHCNTA CAATACCGTC 

"aattttagtc aogtccaaaa caaaaacatt attccagaag atacctcatc ctatc<xtga 

TTAAAA TCAG TGCAGGTTTT G TTTTT CTAA T AAGGTCTTC TATGGAGTAG GATACGGAC T 
' AAGGCTCCAC AGCATGGCGT C^GTCTCCCa" GGGTTCTGAT CCGTCTCCTC' AOGGTGCAAT 
TT««AGGTG TCGTACOGCA GGCAGAGGGT CC CAAGACTA GGCAGAGGAG TGCCAOGTTA 
'aAGGCAGGAC AGAGAG6AGG GCTGCA6GGC TAOCACATTC AC0CAGAA6G TATCTCCTCT 
GTOCGTOCTG TCTCTCCTOC CGACGTCOCG ATGGTCTW^C TGgG^ 
«6cmcKG"MM0aWAi^"^ ATGCTGTATT GAATA6TTCT CT6TGTGACT 

GTGCTAAGTCJOTAGG^^ CCTTACGGTT TACaSACAWA J^TC)^ 

t?SSaa gccaggacac cctgagcctt tccnggggaa ctctaagga^ TSlSSn? 

AAGATCTCTT CGGTCCTGTG GGACTCGGAA AGGJLCCCCTT GAGAOT^ 

ac^iccotgggTattttcagg'a^^^ cggtcgttgt tctcactcgt 

JSSSSc CTAAaSvGTCC TATCGTACCT <nX?TCTCTAG. g«»g^^ 
r;^CC1^S6~WVGGAGAGAc'T6ACCAGAi^ CACTCACTCA GCACTCTGCA GGAGCAGGAG 
?SSIS? J??SSSg J?TGGTCTTT GTGAGTGAGT CGTGAGACGT.a|TCGTCC^ 
AAGATAcSr 'wiGATCJlj^ TTT6ATACAC CCAATACCAT ACACACAGGA 

JJSIS JlCTACTTAG AACCTATCTA AAACTATGTC^GTTAO^GTA.^^ 

gctt«g<S"t«»aagtct a cttccgcgct ctgacccacg gttgtagcgg 

Ig???CA6A TAAGTCAAA G ^GGCGOGA GACT<««5TGC CAACAT^ 
•aGTGGGCTGA ACACTGTAAC ACTGTACATG CGATTTCCCC A]f6GGqTTCT 'JJJJGJ^ 
TCACCCGACT TGTGACATTG TGACATCTAC gCTAAAGGGG TACCCGAAGA TTTTACAGTG 

CATCTCCTCC cctgctgtgt cctactocat ttactggtta caaggtgatg tcaacaasag 

^Z..^. AATCACCAAT GTTOCACTAC AGTTGTTCTC 



3361 
3V2I 
348X 
3541 
3601 

3661 
3721 
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3841 
3901 
3 9 el 
4021 
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Figure 16 (con't) 



PDBlOa and RACES compiled 

4081 AAGCTATCAC AACACCAGGG CTGTGCACAC GTGCACACAC ATGTATGCAC AAGCACACAG 

TTCGATAGTG TTGTGGTCCC GACACGTGTG CACGTGTGTG TACATACGTG TTCGT GTGTC 
4141 ATGTATGTAC AGCACACACA CACACACACA CCCCAAAAGG AGAGAAAAGG AAGAAAACAT 

TAC ATACATG TCGTGTGTGT GTGTGTGTGT GG GGTTT T CC TCTCTTTTCC TTCTTTTGTA 
4201 TTATAAAAAG CGACAGCTAC CCCATATCAA AATAGTCTTT CCTGTAGGAA ACAGGAGCTC 

AATATTTTTC GCTGTCGATG GGGTATAGTT TTATCAGAAA GGACATCCTT TGT CCTCGAG 
4261 TCCATAAGGA ATTATCATGA GTGTGTTCTC CCATCAGT6C ACTCTCCCAG GGGTGCTCAC 

AGGTATTCCT TAATAGTACT CACACAAGAG GCTAGTCACG TGAGAGGGTC CCCACGAGTG 
4321 TGAAGCTGGT CCACRTCTAT iuiACAGGW?A CACTGGCTGC AGCAAAAAGC CATTCGATCC 

ACTTCGACCA GGTGRAGATA TTT G TCCACT GTGACCGACG TCGTTTTTCG GTAAGCTAGG 
4381 ACACAAATTG ATCTTCTATC ATCTTGGAAT CTGAATTGCA GGGAGGAGCA GYATGTAAGA 

TGTGTTTAAC TAGAAGATAG TAGAACCTTA G ACTTAACGT CCCTCCTCGT CyTACATTCT 
4441 CGACCGTTTA ATTCAGGCAT TCCGAAGGCA. TGAGCGCATG GATTCTRTCA CCAAGCGTAT 

GCTGGCAAAT TAACTCCGTA AGGCTTCCGT ACTCGOG TAC CTAA GA RAGT GGTTCGCATA 
4501 AAAAGGACCC TGGCATTGGcTaAACCTAT^^ TTGCTGTAGA AGTAGGGATT 

TTTTC CTGGG ACCGTAACCC TYTGGATACT GCXnXaACAAA AACGACATCT TCATOOCTAA 
4561 TTACAGAAGT CTOCTTGRAT TTGCCCTGCC TGGGGCAGTT TTGCAGAGGA ACCTGCCAGA 

AATGTCTTCA GAGGAACRTA AACGGGACGG ACCCCGTCAA AACGTCTCCT TGGACGGTCT 
"4621 GATTTATT6G CTGGTCAGTC TCTTGTGAAA TAGTATCATG TGAGAAACAG TTTGTAGAAA 
CTAAATAAOC gACCAGTCAG AGAACACTTT ATCATAGTAC ACTCTTTGTC AAACATCTTT 

AAAACTATAC CTGGGAAGAC CTTTGCAACA TTGTTCCTTC CATGGGCCAA GACTCAGTTA 

TTTTGATATG GACCCTTCTG GA AACGTTGT AACAA GGAAG GTACCCGGTT CTGAGTCAAT 
4741 ggaggcataa"1vtctgcccgg AATAAACTAG GCCAGGATAC AGCCATGTTT AGTTAATAAT 

cctccgtat t tagacgggcc ttatttgatc cg gtcc tatg tcgg taca aa tcaattatt a 

4801 ttggttttag^ISattcacaca ggcaggattg gtttttttgt gtcttggcaa gtggagcata 
aaccaaaatc ttaa6tgtgt ccgltxrtaac cajyvaaaaca cagaa ccgtt cacc togtat 

4861* TCTAACATAc'7gGCATGG<5v""a^^ GTCTCACCAA 

AAATTGTAT G TCX^GTACCC T TAGGAC GGAG AATCGAA AAG GGTGGGAGAA CAGAGTGGTT 
4 921 cirriTTCTC TCCAAAGGTT TCCAOSAATT tCTCATTAAT GGCTGATGCA AACTTAGTGA 

CAAAAAAGAG AGGTTTCCAA AGGTCCTTAA AG AGTAATTA CCGACTACGT TTGAATCACT 
4981 "aTAATAATGA ATATAAACAA TGCTCACCTc'Aa^AVATTA TATTATTT<SC AGTCATTTGT 

TATTATTAC T TATATTTGTT ACGAGTGGAG TCGTgTTAAT ATAATAAACXS T CAGTAA^CA 
5041 GAWACACAA ATTTTATCGc" AATGGTTATT ATtTAATTTG TGGCCACACA CTGTGGTTAT 

CTATTGTGTT TAAAATAGCG TTACX^TAA TAftATT 
5101 CTOTIOTTgFgGTTGTT^^ GAGAAAATGr ra'TG(^ OTAAGTGCCA ATACCAGTGT 

GAAAACAACA CCAACAAAGA CTCTTTTACA AGA^CTATA CATTCA TATGGTCACA 

5161 gaagwvttS tc'tc^ GTAAACATCA ATTCTATCTC 

CTTCATAACT AG GGCCCGT C GTTTTATGTC GGATTCCAAA CAW 
5221 AGTTCATCAG AGGGCCTG AG*^ AAGCTGCGGG GCAGTGTiAA GTAAA CTGGGCTGGT 

TCAAGTAGTC TCCCGGACTC TTCGACGCCC CGTCACATTT CATTTCATAC GACC^^ 
5281 GGT6GTCA6C CTCCCCTTGC CAA<^ GCAATTGAAT CCTGTCCCCA GCTCCCTCCA 

CCACCAG TCG GAGGGGAACG GTT CTTCTC T CGTJAACTTA G^ 
5341 CGCCTGAAGA GTGACCAGTG CTGGCCCGAC GGATCGCTGA GATATTCTCC CATAATGGCA 

GCGGA CTTCT C ACTGGTCAC GACCGGGCT G CCTAGCGACT CTATAAGAGG GTATTACCGT 
5401 * AAAAAATAGG "EAGrTTGATG TCACCTGTTT 'aGTCTGGCTC TCCTCTTTTG AGCATGTGTT 

TTTTTTATCC GTCAAACTAC ACTGGACA AA TCACACCGAG AGGAGAAAA C TCGTACACAA 
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PDElOa and RACBs compiled 

54 61 AGCATTTTTA TTTTATACTC ATCCAGTGAA CTCTGCTCTT CCAAGTGTGT TCATGTATGT 
TCG TAAA AAT AAAATATGAG TAGGTC AC TT GAGACGAGAA GGTTCACACA AG TACATACA 
5521 GCTAGATATA TTAGCACAGC CTGCCTTCTG CTGCACAACG CCTTAGAGAC CCGGCCTTTC 
CGATCTATAT AATCGTGTCG GACGGAAGAC GA CGTGT TGC GG AATCTCTG GGCCGGAAAG 
5581 AATGAGCTTA GCTTGTGCTC TGTTTCTGCT CTCTTAGGTC TAAACTATGG TGTCAGTTTT 
TTACTCGAAT CGAACACGAG ACAA AG ACGA GAGAATCCAG ATTTGATOCC A CAGTCAAAA 
5641 AATAGAACAA AAGTATGCAT CTTGCCTTGG CTTGAGCCTT TTCGTTTTCA ATGCTGACTT 
TTATCTTGTT TTCATACGTA GAACGGAACC GAACTCGGAA AAGCAAAAGT TACGACT GAA 
5701 CTCCCCTTTC TCTCCTGTGC TCACCTTACC TTTCCAGAGT GTAAGGGACA ACTTTTAAGCS 
GAGGGGAAAG AGAGG ACACG AGTGGAATGG i^GGTCTCA C^TTCCC^ TG AAAATTCC 
5761 AGGCGTGTCC'CTGGTAGGGG CATCCCT6TT CACCAGGTGC CTGTCATCAC CCCACTTGAC 
TCCGCACAGG GAOCATCCCC GTAGGGACAA GTGGTCCACG GACAGTAGTG GGGT6AACTG 
TGACATCTAC CCTGGTGACT ATGGGTTCCT CTTGTTTGTA GGGAACGGTG GCtCCAGGTG 
ACTGTAGATG 6GACCACTGA TACCCAAGGA G AACAAACAT CCCTTGOCAC CGAGGTCCAC 
GAGGCATCAA TCTGTTGGGT TCTGGTTCCC GGCTGCCTTT GGTTTTGAAA GTCTCTTCTC 
CTCCGTAGT.T AGACAACXXA AGACC AAGGG CCXSAC GG AAA CCAAAACCTT CAGAGAAGAG 
TGTATATTCC TAOCCTGCAT tTGCTTTGTG TGGTGCTGAT GCTGTGGCAG TAGGATCTTG 
ACATATAAGG ATG GGACGTA AAC GAAACAC AC CACGACTA CX5ACAC0GTC ATOCTAGAAC 
oJtGACTCTC CATCAGTCAC AGACX<Xxicc"TGl^CA^ TGTCAGGCTG ACTCX3ACAGT 
CTACTGAGAG GTAGTCAGTG TCT6AGGGGG ACAAOGTTTC ACAGTCCGAC TGAGCTGTCA 
"ioiT CACCGTAAAA TCTGAGTCAG TCACACACAG GCTGTCAGCC ACGGCTTCCA CTTGCATGGC 
GTGGCATTTT AGACTCAGTC AGTGTGTGTC CGACAGTCGG TGCCGAAGGT GAACGTACCG 
'eiii TATTCTATTT TCACACGTGA GTTTCTGTTG CTGGCTGGCT GACTGGCATT ATCTATGCTA 
ATAAGAT AAA AGTGTGCACT CAAA GACAAC GACCGACCGA CTGAOCGTAA TAGATACGAT 
6181 ACTTGAAATC AGGAGTGTGC CCA6CAGAGC CCATCATXCT CACTGTCTTT GAAACAAAGC 
TCAAC TTTAG TCCTCACACG GGTC6TCTCG ffTAGWAGA GT GACAGAAA CTTTGTTTCG 
6241 "i^ACGGTrf GATCGATGAA "cXfTATTTAAA GCATTTCATG CAATGACAAA GTGCTCAGTA 
ACATGCCAAA CTAGCTACTT GCA TAAATTT CGTAAAGTA C G TTACTGTTT CAC GAGTCAT 
€301 'gtggaaggca GGCTGTGACC AGTCTGCCTG CTCCTTACTA TAATTGTGAG GATTTGTTAC 
CAC CTTCCG T CCgACACTGG TCAGACGGAC GAGGAATCM ATTAACACTC CTA AACAAT G 
6361 TGGAACAGTA CATGGAGGCC TGACCri^ GTGGAACCTT AGCTGAATAT 

ACCTTGTCAT GTACCTCCGG ACTGGAACAC CCCCgTCTCC <^CCTTGGAA TCGACTTATA 

6421 iGTGTGTGTCT^ 

TCA CACACAG AGTTCT CCTT CAGTCCCATG ATTCAGTCA^^ 
6481 TATACATTTG^CCCGTTTTAf CTCTAATGTG /UlATAAATO^ CCAAACACTT GTTTATCGTG 

ATATG TAAAC GGGCAAAATA GAG ATTACAC TTTATTTAGG GGTTTGTGAA CAAATAGCAC 
6541* TA6C6TACCT AAAAGACTAT TCTATTATGg' GTGTCCCCAC TTTCTTGGTT ^GGTCACCCC 

ATCGCATGGA TTTTCTGATA AGATAATACC CACAGGGGTG AAAGAACCAA ACCAGTGGGG 

- xw ' 

6601 GATCCCCCGG TCTTCTGCTG TATCTAGAAC AGTGACTATA AATGAT6TAT GGGAATAGTG. 

CTAGGGGGCC AGAAGACGAC ATAGATCTTG TCACTGATAT TTACTACATA CCCTTATCAC 
6661 TTTCCATATG ATCTGrro^^ CATTTACTGT ACAAAAACCC 

AAAGGTATAC TAGACAACAG AC CTC ATATA CGA WA<»A GTAA^ TGTTTTTGGG 
672 1 A6TGCAGCTG ATGATGCyvAA GCAGKn-CTC" TC^ GTGOCCCAOC TATTTAAAAA 

TCACGTCGAC TACTACGTTT CGTCAGAGAG AOACACATGT CAOWSGCTG6 ATAAAI^^ 
678 1 tSSctacaaIioccagaa^ CTTAACATAA GAAACAAACG CAGCGTCTGG 

AGTGCATGTT HGGGTCTTGT GACACTTTGT GAATTGTATT CTTTCTTTGC GTCGCAGACC 
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POBlOa and RACBs compiled 

6841 ATTCTTTCCA AGGAGAGCAG CTTTCTCCAC AGGAACACAG TAACAAAAGA GGTCCGCCGC 
TAAGAAAGGT TCCTCTCGTC GAAAGAGGTG" TCCTTGT6TC ATTGTTTTCT CCAGGCGGCG 

6901 CATCCACACC CAGCCAAGAC ACCTCAGAGG CCATAGGGAC AACCTCCTTG CTGGCCAACA 
GTAGGTGTGG GTCGGTTCTG TGGAGTCTCC GGTA TCXX:TG TTGG AGGAAC GACCGGTTGT 

6961 cctgctggag cagggcacag gtccca'gcaa ctgatcctca gtggatgggt'"ccgcagtcaa 
ggacgacctc gtcccgtgt c cagggtcgtt gactaggagt cacctaccca ggcgtcagtt 

. - — ^ - toW 

7021 AGCCTTAATG GGCTCTCTTT TGAAGGGGAA AGAAANNTTT CAAGCTTATG ATATCCAACA 

TCGGAATTAC CCGAGAGAAA ACTTCCCCTT^ T?HT^??^^:^ GTTCGAATAC TATAGGTTGT 
7081 TTATTATAGT TGATGAGTTA 6TAAATTCCG AAAAAAAAAG ATGATTTTAT ATGTATGACA 

AATAATATCA ACTACTCAAT CATTTAAGGC TTTTTTTTTC TACTAAAATA TACATACTGT 
ilAl TAAAAAAAAT CTTTGTAAAg' TGCGCAAGTG OVATmStTA AAGAGGTCTT^ATCTTTGCAT 

ATTTTTTTTA GAAACATTTC A0GC6TTCAC G TTAT TAAAT TTCT CCAGAA TAGAAAOGTA 
7201 ttaSvaatta taaatattgt*"acatgtgtgt AATTTTTCAT GTATTCATTT GCAGTCTTTG 

A ATATTTAAT ATTTATAACA TGTACACACA T TAAAA AGTA CATAAGTAAA CGTCAGAAAC 
7261 TATTTAAAAA AACTTTACTG TTATGTTTGT ATAATAGAAC ATTAATCATT TATTATAACT 

ATAAATTTTT TTGAAATGAC AATACAAACA TATTAT C TTG TAATTAGTAA ATAATATTGA 
"7321 CAGACAAGGT GTAAATAAAT TCATAATTCA AACAGCXZAGT ATATATGCAT ATATGGGTGT 

GTCTGTTCC A CATTTATTTA AGTATTAAGT TTGTCGGTCA TATATACGTA TATACCCACA 
7381 TACATTGCAA AAATCTCTAT CTTTGTTCTA TTCACATGCT TAAAGAAGTA AGAAATCTTT 

ATGTAACGT T TTTAGAGATA G AAA CAAG AT AAGTGTACGA A TTTCTTCAT TCTTTAGAAA 
' 7441 TGTGGATATG TAATTATACA TATAAAGTAT ATATATATGT ATGATACATG AAATATATTT 

ACACCTATAC ATTAATATGT ATATTTCATA TATATATACA TA CTATGTAC TTYATATAAA 
7501 AGAAATGTTC ATAATTTTAA isGAWTTCT TTGGTGTGAA TAATTGAATA CAACATTTTT 

TCTTTACAAG TATTAAAATT J^CCTATAAGA AACCACACJ? ATTAACTYA T GTTGTAAAAA 
7561 AAJUITC^^ C 

TTTTACTTTT txTTTTTTTT G 
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Figure 1 9 

PDEl'OA compiled 

1 CGCCCGGGCA GGTCTGTTGG AGGGCAGTTG GTCAACCTGA CCAGAGAGAG CTGAGCTGGA 
GCGGGCCCGT CCAGACAACC TC CCGTCAAC CAGTTGGACT GGTCTCTCTC GACTCGACCT 
61 AGACCCCACT GATGGTGTGC TGCCTTTCAG TCCAG6AAGA AAGAAAGGAA GGATTCTGAG 
_ TCTGGGGTGA CTACCACACG ACGGAAAGTC AGGTCCTTCT TTCTTTCCTT CCTAAGACTC . 
121 GATTTGGGCA AAGCCACATT CCTGGAGAAG TCTGTATACT GATGCCAAAC CCAAGAGCTG 
CTAAACCCGT TTCGGTGTAA GGACCTCTTC AGACATATGA CTACGGTTTG GGTTCTCGAC 
181 A6CTGCTGAT GAGGCCCAGG GAGTAGCCCA CGCGCCCTGA GCTGTTGGCT AGCAAGGCCT 

TC GACGACTA CTCCGGGTCC CTCATCGGG T GCGCGGGACT CGACAACCGA TCGTTCCGGA 

241 TCCTGCTCCA TGT6GCATGG AAAAATTATA TGGTTTGACG GATGAAAAGG TGAAGGCCTA 
AGGACGAGGT ACACCGTACC TTT TTAATAT ACCAAACTGC CTACTTTTCC ACTTCCGGAT 
Toi TCTTTCTCTC CATCCCCAGG TATTAGATGA ATTTGTTTCT GAAAGTGTTA GTGCAGAGAC 
AGAAA6AGAG GTAGGGGTCC ATAATCTACT TA AACAAAGA CTTTfflCAAT CACGT CTCTG 
361 TGTGGAAAAG TGGCTGAAGA GGAAAACCAA CAAAGCAAAA GATGAACCAT CTCCCAAGGA 
ACACCTTTTC ACCGACTTCT CCTTTTGGTT GTTTCGTTT T CTACTTGGTA GAGGGTTCCT 
"421 AGTCAGCAGG TACCAGGATA CGAATATGCA GGGAGTCGTG^ TACGAGCTGA ACAGCTACAT 

TCAGTCGTCC ATGGTCCTAT GCTTATAC6T C CCTCA GC AC ATTO 
481 AGAGCAGCGC CTGGACACGG GCGGGGACAA CCACCTGCTC CTCTATGAGC TCaVGCAGCAT ^ 
TCTCGTCG CG GACCTGTGCC CG CCCCTGTT GGTGGACGAG GAGATACTCG AGTCGTCGTA 
541 CATCAGGATA GCCACAAAAG CCGACGGATT "tGCA TTCCTTGGAG AGTGCAATAA 

GTAGTCCTAT CGGTGTTTTC GGCTGCCTAA ACGTGACAT G AAGGAACCTC TCACGTT ATT 
601 TAGCCTGTGT GTGTTCATAC CACCCGGGAT GAAGGAAGGC CAACCCCGGC TCATCCCTGC 
ATCGGACACA CACAAGTATG GT6GGCCCTA CTTCCTTCC G G TTGGGGC CG AGTAGGGACG 
"661 AGGGCCCATC ACCCAGGGTA CCACCATCTC TGCCTACGTG GCCAAGTCTA GGAAGACGTT 
TCCCGGG TAG TGGGTCCCAT GGTGGT AGAG ACGGATGCAC _ CGGT^CAGAT CCTTCTGCAA 
721 GTTGGTAGAG GATATCX:TTG GGGATGAGCG AOTTCxiTCGA GGTACTGGCC TGGAATCAGG 
CAACCATCT C CTATAGGAAC CCCTACT CGC TAAAGGAGCT CCATGACCGG ACCTTAGTCC 
781 ' AACCCGCATC CAGTCTGTTC TTTGCTTGCC CATTGTCACT GCCATTGGAG ACTTGATTGG 
TTGGGCGTAG GTCAGACAAG AAACGAA CGG G TAACAGTGA CGGTAACCTC *GAACTAACC 
841 CATCCTTGAA CTGTACAGGC ACTGG<3(^AA AGAGGCCTTC TGCCTCAGCC ATCAGGAGGT 
GTAGGAACTT GA CATGT CC G TG ACCCCGTT TCTCCGGAAG ACGGAGTCGG TAGTCCTCCA 
9o~"tgcaacagcc"am GGGCTTCCGT AGCAATACAC CAGGTGCAGG TGTGTAGAGG 

ACGTT GTCGG TTAGAACGAA CC CGAAGGCA TCGTTATGTG GTCCACGTCC ACAO^TCTCC 
"961 TCTCGCCAAA CAGACCGAAC TGAATGACTT CCTACTCGAC GTATCAAAGA CATACTTTGA 
AGAGCG GTTT GTCTGGCTTG ACTTA CTGAA GGATGAGCTG CyVTAGTTTCT GTATGAAM:T 
1621 TAACATAGTT GCCATAGACT CTCTACTTGA ACACATCATG ATATATGCAA AAAATCTAGT. 

ATTGTATCAA CGGTATCTGA GAGATGAACT TGTGTAGTAC TATM TTTTAGATCA 
1081 GAACGCCGAC"CGCTGCGCGC TCTTCCAGGT GGACCACAAG AACAAGGAGC TGTACTCGGA 
CTTGCGGCTG GCGACGCGCG AGAAGGTCCA CCTGGTGTTC TTGTTCCTCG ACATGAGCCT 
1141 CCTGTTTGAC ATTGGGGAGG AGAAGGAGGG GAAGCCCATC TTCAAGAAGA CCAAGGAGAT 
GGACAAACTG TAACCCCTCC TCTTCCTCCC CTTCGGGTAG AAGTTCTTCT GGTTCCTCTA 
120r CAGATTTTCC^ TCAAGTGGCA AGAACAGGCG AAGTCTTGAA 

GTCTAAAAGG T AACTCTTTC CC TAACGACC AGTTCACCGT TCTTGTCCGC TTCAGAACTT 
1261 CATTCCCGAT GCCTACGCGG ACCCTCGCTT TAACAGGGAG GTGGACCTGT ACACAGGCTA 
GTAAGGGCTA CG GATGCGCC TGGGAGCGAA ATTGTCCCTC CACCT6GACA TGTGTCCGAT 
1321' caccacgagg"aacattctgt GTATGCCCAT AGTGAGCCGA GGCAGCGTGA TTGGCGTGGT 
GTGGTGCTCC TTGTAAGACA CATACGGGTA TCACTCGGCT CCGTCGCACT AACCGCACCA 
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Figure 1 9 (cpn't) 

PDBlOA compiled 

1381 GCAGATGGTG AACAAGATCA GCGGTAGCGC CTTCTCCAAG ACAGACGAGA ACAACTTCAA 

CGTCTACCAC TTGTTCTAGT CGCCATCGCG GAAGAGGTTC TGTCTGCTCT TGTTGAAGTT 
1441 GATGTTTGCT GTCT^CTGCG <^C^^^ GCACTGTGCT AACATGTACC ACAGGATCCG 

CTACAAACGA CAGAAGACGC GTGACCGGAA CGTGACACGA TTGTACATGG TGTCCTAGGC 
1501 CCAcfcAGWV TGCAT GGAGAAGCTT TCCTACCACA GCATCTGCAC 

GGTGAGT CTT ACGTAGATGT CC CAATGGTA CCTCTTCGAA AGGATGGTGT CGTAGACGTG 
Isei CTCCGAGGAG TGGCAAGGCC TCATGCGCTT CAACCTACCA GCACGCATCT GCCGGGACAT 

GAGGCTCCTC ACCGTTCCGG AGTACGCGA A GTTGGATGGT CGTGCGTAGA CGG^^^^ 
Tell CGAGCTATTC CACTTTGACA TTGGTCCTTT OGAGAACATG TGGCCTGGGA fcTTTGTCTA 

GCTCGATAAG GTGAAACTGT AACCAGGAAA GCTCTTGTAC ACCGGACCCT AGAAACAGAT 
1681 CATGATCCATCG^ TTTTGAACTT GAAAAATTGT GCCGTTTTAT 

GTAC TA6GTA G CC AGAACA C CCTGTAGGAC^ AAAACTTj^^ CTTTTTAACA CG6CAAAATA 
T74I (^GTCTGTG AAGAAGAACT~ATCGGCGGGT TCciTACCAC AACTGGAAGC ATGCAGTCAC 

GTACAGACAC TTCTTCT T6A TAGCCGC CCA AGGAATGGTG TTGACCTTCGJTACG^ 
Till GGTGGCACAC" TGCATGTATG CCATACTTCA AAACMVCAAT GGCCTCTTCA CAGACCTCGA 

CCaVCCGTGTG AC GTACATAC GGTA TGAAGT TTTGTTC^^^ CCGGAGAAGT GTCTGGA6CT 
Te'a GCGCAAAGGC CTGCTAATTG CGTGTCTCTg" CCATGACCTG GACCACAGGG GCTTCAGTAA 

CGCGTTT CCG GACGATTAAC GCA CAGACAC GGTACTGGAC CT^ 
1921 CAGCTACCTG CAGAAGTTCG ACCACCCCCT oklGGCG^ TACTOaVCCT CCACCATGGA 

G TCGATGGAC GTCTTCAAGC TGGTG GGGGA CCGCCGCGAC ATGATCTGGA GGTGGTACCT^ 
"iSBl GCAACACCAC TTCTCCCA6A CGGTGTCCavf CCTTCAGCTG GAAGGGCACA ATATCTTCTC 

CG TTGTGGTG AA6AGGGTCT GCCACAGGTA GGAAGTC GAC . CTOCCCGTGT TATA GAAGA G 
"2^J^ CACCCTGAGC TCCaVGCGAGT ACGAGCAGGT GCTGGAGATC ATCCGCAAAG CCATCATCGC 

GTGGGACT CG AGGTCGCTCA TGCT CGT<XA CGACCTCTAG TAGGCGTTTC GGTA GTAG CG 
"ilbf" CACCGACcic GCCCTATACT TTGGGAACAG GWVGCAGTTG GAGGAGATGT ACCAGACAGG 

GTGGCTGGAG CG GGATATGA AA CCCTTGTC CTTCGTCAAC CTCCTCTACA TGGTCT6TCC 
"2161 GTdGCTGAAC CTCCACAACC AGTCCCATCG AGACCGTGTC ATCGGCTTGA TGATGACTGC 

CAGCGACTTG_GAGGTGTTGGjrCA^^ 
IziT CTGTGATCTT TGCTCTGTGA CCAAACTATG GCCAGTTACA AAATTGACAG CGAATGATAT 

GACACTA67A ACGAGACACT GGTTTGATAC CGGTCAATGT TTTAACTGTC GCTTACTATA_ 
" 22V1 " ATATGCAGAiT TTCTG^^ AGGGTGAT6A GATGAAGAAG CTGGGCATAC AGCCCATTCC 

TATACGTCTT AAGA CCC GAC TCCCA CTACT CTACTTCTTC GACCCGTATG TCGGGTAAGG 

2341 tatgatggac'agagacaagc gagatWgV "c'cctoCaggg CAGCTCGGAT TCTACAATGC 

ATACTACCTG TCTCTGTTCG_CTCTACTTCA GTCGAGCCTA AGATGT TACG 

240i~TGT6GCci5T "CCCT^^ ajACcf-roAc"©^ CCACCCACAG AGCCTCTGCT 

ACACCGGTAA GGGACGATAT GGT6GAACTG CGTCTAGGAG 6GTGGGTGTC TCGGAGACGA 
"246Y 6W\6GCCT6C~AGGGATiicC TCA^ GGAGAAGGTA ATTCGCGGGG AAGAGACAGC 

CTTCCGGACG TCCCTATTGG AGTTAGTCAC CCTCTTCCAT TAAGCGCCCC TTCTCTGTCG 

252i' ''aatgtggaTt^'t^^^ tagcaagagc acacctgaga agctgaacgt 

TTACACCTAAAGTC_CGGGTC_CGG^^^ TGTGGACTCT TCGACTTGCA 

'2581 "gaaggttgaa gactgatcct (»agtgacW'cctgatgtc^ GCCCAGCAAC CGACTCAACC 

CTTCCAACTT CTGACTAGGA CTTCACTGCA GGACTACAGA CGGGTCGTTG GCTGAGTTGG 
7641 TGCITCT^g'^TTCGTTCT TTTfcTTTTC AAGGGGTGAA AACCCCCTGT CAGAAGGTAC 

ACGAAGACAC TGAAGCAAGA AAAACAAAAG TTCCCCACTT TTGGGGGACA GTCTTCCATG 
270l " CCTCGCATAT CCATGTGA^^ CAGACGACTc" CCTGCT GCACACACCT CGGACAGTGA 

GCAGCGTATA GGTA<y^CTTC GTCTGCl^^^^ CGTGTGTG6A GCCT6TCACT 
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Figure 19 (con't) 
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2761 GCAACCCAGG CTCTGCCGTG TTCAGACGTC GGCTACTCCG TGGCTCCACC TGACCTCCGA 
CGTTGG GTCC GAGA CGGCA C AAGTCTGCAG CCGATGAGGC ACCGAGGTGG ACT6GAGGCT 
2821 ATGCTATTTG CTCCCAGGCC AGCACTGCAC TGTCTGGAGG G6GCAGAGAC CACAGGAGAG 

TACGATA AAC GAGGGTCCGG TCGTGACGTG ACAGACCTCC CCCGTCTCTG GTGTCCTCTC 

2881 GTTCTTGCCT GCATCCTCCC ATGAGGGTGT GGCCAGTTCC CTAGTTCTGT GCCATGCTGC 

CCGGTCAAGG GATCAAGACA CGGTACGACG. 

2941 TGCTTGGTGG CATT6GTTAG GAATGGGACA CACGCCCCTT GTTGTGAAGT TTACATGTGA 

...^.99^£9^.^.9..9J/^?5i^^'rc cttaccctgt gtgcggggaa caacacttca aatgtacact 

3001 CCTTCTTATA GGTTAACTGA GTTTGTGGCC TGGGACACAT GTAATGAAGG TCACAGTCCA 

GGAAGAATAT CCAATTGACT CA AACACCGG ACCCTGTGTA CATTACTTCC AGTGTCAGGT 

3061 CAGGTGACAG AGAAATCCAA ACTGTTGATT ACAGGTGCAC TACAGGTATG CTCTTTCAGT 

GTCCACTGTC TCTTTAGGTT TGACAACTAA TGTCCACGT G ATGTCCATAC GAGAAAGTCA 

3121 CTATCTGGGG GCACATAGGT GAGTCTGCTC CACTCAGAAg' GAAGCAT^^ 

G^TAGACCCC CGTGTA^ CTCAGACGAG 6TGAGTCTTC CTTCGTATGG AGASGGGAGT 

3181 TCCAGGGGAC AGAGGGTACA TCCCAGGCAT CGOSGAACTG AAGCTCTCAC TT^^ 

, TTCGAGAGTG AAGTTTGGTA 

3241 GTCAAAGAAT TAAAACACCT CCCCTCCCCC TCACTGTAGC CTTCG TGCGCCAATC 

CAGT TTCTT A ATTTTGTGGA GGGGA GGGGG AGTGACATCG GAAGCCGTTG ACGCGGTTAG 

3301 CCTTTATACA AAGAAAATAT AAGTAAGGCA TATAAATTTC CTCCAGCAAG CAAATCTTCT^ 

GGAAATATGT TTCTTTTATA TTCATTCCGT ATAT TTAAAG GAGGTCGTTC GTTTAGAACA 

3361 GGGTAAAAAA AAAAAATGTG AATTTTAACA ACCTCTATAT TTTCACTGTA~T6TTATGGCA 

CCCATTTTTt YTTTTTACAC TTA AAATTGT TGGAGATATA AAAGTGACAT ACAATACCGT 

3421 GAATTTTAGT CACGTCCAAA ACAAAAGATT AMCCAGAAG ATACCTCATC CTATGCCTGA 

CTTAAAATCA. GTGCAGGTTT TGTTTTCTA A TAAGGTC TTC TATGGA6TAG GATACGGACT 

3481 AAGCTCCACA GCATGGCGTC CGTCTCCCAG GGTTCTGATC CGTCTCCTCA CGOTGCAA 

TTCGAGGTGT CGTACCGCAG GCAGAGG GTC CCAAGACTAG GCAGAGGAGT GCCACGTTAG 

3541 AGGCAGGACA GGAGGAGGTG CAGGGCTACC ACATTGACCC AGATGGTATC TCCTCT<^ 

T CCGTCC T GT CCTCCTCCAC GTCCCGAT GG TGTAACTGGG TCTACCATAG AGGAGA GTGG 

3601 ATTCAGACAT CCATAAGGAA TGCCAAATGC TGTATTGAAT AGTTCTCCTG TGTGACTTTC 

TAA GTCTGTA GGTATTCCTT ACGG TTTACG AOITAACTTA TCAAGAGGAC ACACTGAAAG 

3661 TAGAGAAGCC AGGACACCCC TGAGCCTTTC CTGGGAACTC CTAAGGAAGT CACAGGTTCA 

A TCTCTTCGG TCCTGTGGGG ACTCGG AAAG GACCCTTGAG GATTCCTTCA GTGTCCAAGT 

3721 CACCGTGGGG ATTTTCAGGA TAGttTGGAG ACCAGAGAM TTCTTCTCAcT 

_ GTGGC ACCCC T AAAAG TCCT A TCGTACCTC TGGTCTCTTA GGGCCAAGCC AACAAGAGTG 

3781 TCGGTGAGCC TTGAGAAGGA AGAGACTGAC CAGAAACACT CACTCAGCAC TCTGGCAGGA* 

_ AG CCACTCGG AACTCTTCCT TCTCTG ACTG GTCTTTGTGA GTGAGTCGTG AGACCGTCCT 

3841 GCAGGAGAAG ATACTTTAAG ATGAATCTTT GGGATAGATT TTGATACACC CAATACCATA 
CGTCCTCTTC TATGAAATTC TACTTAGAAA CCCTATCTAA AACTATGTGG GTTATGGTAT 
3901 CACACAGGAG CTTGGCATTT GCAAAGTCTA TTCAGTTTGC TTCCACACTC TGACCCACGG 

!?J.?I?T???£„J?.^?^?TAAA CGTTTCAGAT AAGTCAAAGG AAGGTGTGAG ACTGGGTGCC 

3961 TTGTAGCGGA GTGGGCTGAA CACTGTAACA CTGTACATGC GATTTCCCCA TGGGCTTCTa" 
AACATCGCCT CA CCCGA CTT GTGACATTGT GACATGTACG CTAAAGGGGT ACCCGAAGAT 
4021 AAATGTCACC ATCTCCTCCC CTGCTGTGTC CTACTCCATT TACTGGTTAC wiGGTGATGT 

TTTACAGTGG TAGAGGAGGG GACGACACA G GATGA GGTAA JVTGACCAATG TTCCACTACA 

4081 C7UVCAAGAGA AGCTATCACA ACACCAGGGC TGTGCACACG TGCACACACA TCTATGCAoT 
GTTGTTCTCT TCGATA^^ AqACGTGTGC ACGTGTGTGT ACATACGTGT 
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Figure 19 (con't) 

PDEIOA complied 

4141 AGCACACAGA TGTATGTACA GCACACACAC ACACACACAC CCCAAAAGGA GAGAAAAGGA 
TCGTGTGTCT ACATACATGT CGTGTGTGTG TGTGTGTGTG GGGTTTTCCT CTCTTTTCCT 
4201 AGAAAACATT TATAAAAAGC GACAGCTACC CCCATATTCA AAAATAGTTC TTTTCCCTGT 
TCTTTTGTAA ATATTTTTCG CTGTCGATGG GGGTATAAGT TTTTATCAAG AAAAGGGACA 
4261 AGGGAAACAG GTAGCTCTCC ATAAGGAAAT TATCATGAGT GTGTTCTCCC ATCAGTGCAC 

TCCCT TTGTC CATCGAGAGG TATTCCTTTA ATAGTACTCA CAC^ TAGTCACGTG 

4321 TTCTCCCAGG GGT6CTCACT GAAGCTGGTC CACGTCTATA AACAGGTGAC ACTGGCTGCA 

AA GAGGGT C C CCACGAG TGA CTTCG ACCAG GTGpVGATAT TTGTCCACTG TGACCGACGT 

4381 GCAAAAAGCC ATTCGATCCA CACAAATT Ga" TCTTC TCTTGGAATC TGAATTGCAG 

S9'!^7yTI299..'i^9StE!^9S'^ gtgtttaact agaagatagt apaaccttag acttaacgtc 

4441 GGAGGAGCAG CATGTAAGAC GACCGTTTAA TTCAGGCATT CCGAAGGCAT GAGCGCATGG 
C CTCCTCGTC GTACATTCTG CTGGCAA ATT AAGTCCGTAA GGCTTCCGTA CTCGCGTACC 

4501 ATTCTGTCAC CAAGCGTATA AAAGGACCCT GGCATTGGGA AACCTATGAC^GGACTGT^ 
TAAGACAGTG GTTCGCATAT TTTOCTGGG A CO^AAOyT OCTG ACAAAA 

4561 TGCTGTAGAA GTAGGGATTT TACAGAAGTC "tCCTTGGATT TGCCCTGCCT GGGGCAGTTT 
ACGACAT C TT CATCCCTAAA ATGTCT TCAG AGGgACCTAA ACGGGACGGA CCCCGTCAAA 

4621 TGCAGAGGAA CCTGCCAGAG ATTTATTCGC TGGTCAGTCT CTTGTGAAAT ACTAOXaiiGT 
AOGTCTCCCT GGA CGGTCTCJ ^^ ACCAGTCAGA Si^S^iP??^^. TCATAGTACA 

4681 GAGAAACAGT TTGTAGAAAA AAACTATACC TOTC^ TTTGCAACAT TCTTCXnTCC^ 

CTCTTTGTCA AACATCTTTT TTTGATATGG ACCCTTCTGG AAACGTTGTA ACAAGGAAGG 

4741 ATGGGCCAAG ACTCAGTTAG GAGGCATAAA TCTGC0CX3GA ATAAACTAGG CCAGGATACA 
TACOCGGTTC T<aVGTCAATC CTCCGTATTT A GAOGGGOCT TATTTG ATCC GGTOCTATGT 

4801 GCCATGTTTA GTTAATAATT TGGTTTTAGA ATTCACACAG GCAGGATTGG TTTTTTTGTG 
CGGTACAAAT CAATTATTAA ACCAAAATCT TAAGTGTGTC CGTCCTAACX: AAAAAAACAC 

4861 TCTTGGCAAG TGGAGCATAT TTAACATAGA GGCATGGGAA TOCTGCCTCT TAGCTTTTCC 

A GAACCGTTC ACCTCGTATA AATTG TATGT CCGTACOCTT AGGA«SGAGA A TCGAAAAGG 

"4921 " CACCCTCTTG TCTCACCAAG TTTTTTCTCT O^AAGGTTT OCAGGAATTT CTCATTAATG 
GTGGGAGAAC AGAGTGGOTC AAAAAAGAGA GGTTTOCAAA GGTOCTTAA A GAGTA ATTAC 

4981 GCTGATGCAA ACTTAGTGAA TAATAATGAA TATAAACAAT GCTCACCTCA OCAAAATTAT 
CGACTACGTT TGAATCACTT ATTATTACTT ATATTTGTTA OGAGTGGAGT GGTTTTAAT A 

5041 ATTATTTGCA GTCATTTGTG ATAAGAGAAA TTTTATOGCA ATGGTTATTA TTTAATTTGT 
TAATAAACGT CAGTAAACAC TATTGTG TTT AAAATAGCGT TACCAATAAT AAATTAAACA 

5101 GGCCACACAC TGTGGTTATC TTTTGTTGTG GTTGTTTCTG AGAAAATGTT CTTGGATATG 
C CGGTGtGTG ACACCAATAG AAAACA ACAC CAACAA^^ 5??????^?^. GAAC CTAT AC 

5161 TAAGTGCCAA TACCAGTGTG AAGTATTGAT aX^GGGoic^ AAAATACAGC CTAAGGTTTG 

A TTCACGGTT ATG G TCACAC TTCATAA ^YA G GGCC CGTCG J^][??ATGTCG_ GATTCCAAAC 

" siizi TAAACATCAA TTCTATCTCA GTTCATCAGa" TCGCCTGAGA AGCTCCGGG6 CAGTGTAAAG 
ATTTGTAGTT AAGA T AGAGT CA AGTAGTCT CCCGGACTCT TCGACGCCCC GTCACATTTC 

sizei TAAAGTATGC TGGGCTGGTG GtGGTCAGCC TCCCCTTGCC AAGAAGAGAG CAATTGAATC 
_ ATTTC ATACG ACCCGACCAC CACCAG TCGG AGGTOAACGG TTC 

5341 CTGTCCCCAg" CTCCCTCXyVC GCCTGAAGAG TGACCAGTGC TGGCCCGACG GATCGCTGAG 
^^GACAGGGGTC GAGGGAGGTG CGG ACTTCTC ACTGGTCACG ACCGGGCTGC CTAGCGACTC 

5<0l ATATTCTCCCnATAATGGCAA AAAAATAGGC AGTTTGATGT GACCTGTTTA CTGTGGCTCT 
TATAAGAG6G TA TTACCGTT TTTTTATCCG TCAAACTACA CTGGACAAAT CAjCACCGAGA 

5461 CCTCTTTTGA GCATGTGTTA GCATTTTTAT* TTTATACTCA TCCAGTGAAC TCTGCTCTTC 

GG AGAAAACT CGTACACAAT CGTA AAAATA AAATATGAGT AGGTCACTTG AGACGAGAAG 
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5521 CAAGTGTGTT CATGTATGTG CTAGATATAT TAGCACAGCC TGCCTTCTGC TGCACAACGC 

_giTCA9^g^_ GTACATACAC GATCTATATA ATCGTGTCGG ACGGAAGACG ACGTGTTGCG 

5581 CTTAGAGACC CGGCCTTtCA ATGAGCTTAG CTTGTGCTCT GTTTCTGCTC TCTTAGGTCT 
GAATCTCTGG CAAAGACGAG AGAATCCAGA 

5641 AAACTATGGT GTCAGTTTTA ATAGAACAAA AGTATGCATC TTGCCTTGGC TTGAGCCTTT 

JJII?.^I^5ir^. .9^1?T?/^:f^^^. tatcttgttt tcatacgtag aacggaaccg aactcggaaa 

5701 TCGTTTTCAA TGCTGACTTC TCCCCTTTCT CTCCTGTGCT CACCTTACCT TT.CCAGAGTG 

j^gg^ft^GTI ■ j^gfl^9?Q^Q AGGGGAAAGA GAG6ACACGA GTGGAATGGA AAGGTCTCAC 

5761 TAAGGGACAA CTTTTAAGGA GGCGTGTCCC TGGTAGGGGC ATCCCTGTTC ACCAGGTGCC 

A TTCCCTGTT GAAAATTCCT CCGCAC AGGG ACCATCCCCG TAGGGACAAG TGGTCCACGG 

5821 T6TCATCACC CCACTTGACT GACaVTCTACC CTGGTGACTA TGGGTTCCTC TTCTTTGTAG 

ACAGTAGTGG GGTGAACTGA CTGTAGATGfG GACCACT GAT ACCCAAGGAG AACAAACATC 

5881 GGAACGGTGG CTCCAGGTGG AGGCATCAAT CTGTTGGGTT CTGGTTCCCG GCTG^ 

OCTTGC CACC GAGGTCCACC Ta:GTA GTTA GACAAOCCAA GACCAAGGGC CGACGGAAAC 

5941 GTTTTGAAAG TCTCTTCTCT GTATATTCCT ACCCTGCATT TGCTTTGTGT GGTGC*GATG 

CAAAA CTTTC A GAGAAGAGA CATAT AAGGA TGGGACGTAA ACGAAACACA CCACGACTAC 

6001 CTGTGGO^T AGGATCTTGG ATGACTCTCC ATCAGTCACA GACT<XCCCT QTTQcKiii^T 

GACACCGTCA TCCTAGAACC TACTGAG A GG TAGTCAGTGT CTGAGGGGGA CAACGTTTCA 

6061 GTCAGGCT6A CTCGACAGTC ACCGTAAAAT CTGAGTCAGT CACACACAGg" CTGTCAGCCA 

cagtocgact gagctgtcag tggcatttta gactcagtca gtgtg tgtcc gacagtcggt 

6121 CGGCTTCCAC TTGCATGGCT ATTCTATTTT CACACGTGAG l^TTCTGTTGC TGGCTGGCTG 

gccgaa ggtg aacgtaccga taag ataaaa gtgtgcactc aaagacaacg accgacx:gac 

6181 acrisgcatta tctatgctaa gttgaaatca ggagtctgcc cagcagagcc catcatoctc 

TGACXGTAAT AGATACGATT CAACTT TAGT CCTCACACGG GTCGTCTCGG GTAGTAAGAG 

6241 ACTGTCTTTG AAACAAAGCT GTACGGTTTG ATCGATGAAC GTAtTTAAAG CATTTCATGC 

^TGACAGAAAC TTTGTTTC6A CATGCCAAAC TAGCTACTTG CATAAATTTC GTAAAGTACG 

6301 AATGACAAAG TGCTCAGTAG TGGAAGGCAG GCTCTGACCA GTCTGCCTGC TCC^TACTAT 

T TACTGTTTC ACGAGTCATC ACCTTC CGTC CG ACACTGG T CAGACGGACG AGGAATGATA 

6361 AATTGTGAGG ATTTGTTACT 6GAACAGTAC ATGGAGGOCT GACCTTGTGG t^CACAGGG 

TTAA CACTCC TAA A CAATGA CCTTGT CATG TACCTCCGGA CTGGAACACC OCCGTGTCCC 

6421 TGGAACCTTA GCTGAATATA GTGTGTGTCT CAAGAGGAAG TCAGGGTACT K^XTCAGTGC 

ACCTTGGAAT CGACTTATAT CACACACAGA GTTCTCCTT C AGT00CAT6A TCGAGTCACG 

6481 TCAATCTCCA GGTACTATAT ATACATTTGC CCGTTTTATC TCTAATGTGA AATAAATCCC 

AGTTAGAGGT CCATGATATA TATGTA A ACG GGCA AAA>TAG AGATTACACT TTAXTTAGGG 

6541 GAAACACTTG TTTATCGT6T AGCGTACCTA AAAGACTATT CTATTATGGG lOTCCCCACT 

GTTTGTGAAC AAATA GCACA TCGCATGGAT TTTCTGATAA GATAATACCC ACAGGGGTGA 
6601 TTCTTGGTTT GGTCACCCCG ATCCCCCGGT CTTCTGCTGT ATCTAGAACA GTGACTATAA 

MGAACCATUV CCAGTGGGGC JAGGGGGCCA GAAGACGACA TAGATCTTGT CACT6ATATT 
6661 ATGATGTATG GGAATAGTGT TTCCATATGA TCTGTTGTCT GGAGTATATG CTACATGTTC 

TACTA CATAC CCTTATCACA AAGGTATACT AGACAACAGA CCTCATATAC GATGTACAAG 

6721 ATTTACTGTA CAAAAACCCA GTGCAGCTGA TGATGCAAAG CA6TCTCTCT CTGTGTACAG 

TAAATGACAT GT TTTTG GGT CACGTCGACT ACTACGTTTC GTCAGAGAGA GACACATGTC 

6781 TGCCCCACCT ATTTAAAAAT CACGTACAAN CCCAGAACAC TGTGAAACAC TTAACATAAG 

ACGGGGTGGA TAAATTTTTA GTGCATG T TN GGGTC TTGTG ACACTTTGTG AATTGTATTC 

6841 AAACAAACGC AGCGTCTGGA TTCTTTCCAA GGAGAGCAGC TTTCTCCACA GGAACACAGT 
TTTGTTTGCG TCGCAGACCT AAGA AAGGTT CCTCTCGTCG AAAGAGGTGT CCTTGTGTCA 
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6901 AACAAAAGAG GTCCGCCGCC ATCCACACCC AGCCAAGACA CCTCAGAGGC CATAGGGACA 

TTGTTTTCTC CA GGCGGCG G TAGGTGTGGG TCGGTTCTGT GGAGTCTCCG GTATCCCTGT 

6961 ACCTCCTTGC TGGCCAACAC CTGCTGGAGC AGGGCACAGG TCCCAGCAAC TGATCCTCAG 

TGGAGGAACG ACCGGTTGTG GACGACCT CG TCCCGTGTCC AGGGTCGTTG ACTAGGAGTC 

7021 TGGATGGGTC CGCAGTCAAA GCCTTAATGG GCTCTCTTTT GAAGGGGAftA GAAANNTTTC 
ACCTACCCAG GC^^ C^GAATTACC CGAGAGAAAA CTTCCCCTTT CTTTNNAAAG 

7081 AAGCTTATGA TATCCAACAT TATTATAGTT GATGAGTTAG TAAATTCCGA AAAAAAAAGA 
TTCGAATACT ATA^^ ATAATATCA^, CTACTCAATC ATTTAAGGCT TTTTTTTTCT 

7141 TGATTTTATA TGTATGACAT AAAAAAAATC TTTGTAAAGT GCGCAAGTGC AATAATTTAA 

,^9yj^fiftft^^?_ j:9^.Ty.T^^^ TTTTTTTTAG AAACATTTCA CGCGTTCACG TTATTAAATT 

7201 AGAGGTCTTA TCTTTGCATT TATAAATTAT AAATATTGTA CATGTGTGTA ATTTTTCATG 

TCTCCAGAAT AGAAACGTAA ATATTTAA TA TTTATAACAT GTACACACAT TAAAAAGTAC 

7261 TATTCATTTG CAGTCTTTGT ATTTAAAAAA ACTTTACTGT TATGTTTCT^ TAATAGAACA 

ATAAGTAAA C GTCAGAAACA TAAATT TTTT TGAAATGACA ATACAAACAT ATTATCTTGT 

7321 TTAATCATTT ATTATAACTC AGACAATCTC TAAATAAATT CATAATTCAA ACAGCCAGTA 

AATTA GTAA A TAATATTGAG TCTGTTCCAC ATTTATTTAA GTATTAAGTT TGTCGGTCAT 

7381 TATATGCATA TATGGGTGTT ACATTGCAAA AATCTCTAtC TtTGTTCTAT TCACATGCTT 

ATATACGTAT ATACCCACAA TGTAA CGTTT TTAGAGATAG AAACAAGATA AGTGTACGAA 

7441 AAAGAAGTAA GAAATCTTTT 6TGGATATGT AATTATACAT ATAAAGTATa" TATATATGTA 

TTTCTTCATT CTTTAGAAAA CACCTATACA T TA ATATGTA T ATTT CATAT ATATATACAT 

7501 TGATACATGA AATATATTTA GAAATGTTCA TAATTTTAAT GGATATTCTT TGGTGTGAAT 

ACTATGTACT TTATATAAAT CTTTACAAG T ATTAAAATTA CCTATAAGAA ACCACACTTA 

7561 AATTGAATAC AACATTTTTA AAATGAAAAA AAAAAAAAAA AAAAAAAAAA 
^ TTAACTTATG TTGTAAAAAT TTTACTTTTT XTTTTTTTTT tTTTTTTTTT tTTTTTTT 
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<130> 36541-0005 

<140> 
<141> 

<150> US60/158,043 
<151> 1999-10-07 

<150> US60/217,765 
<151> 2000-07-12 

<160> 12 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 3236 
<212> DNA 
<213> mouse 

<400> 1 

cactgaagct ggtccacgtc tataaacagg tgacactggc tgcagcaaaa agccattcga 60 
tccacacaaa ttgatcttct atcatcttgg aatctgaatt gcagggagga gcagtatgta 120 
agacgaccgt ttaattcagg cattccgaag gcatgagcgc atggattctg tcaccaagcg 180 
tataaaagga ccctggcatt gggaaaccta tgacggactg tttttgctgt agaagtaggg 240 
attttacaga agtctccttg aatttgccct gcctggggca gttttgcaga ggaacctgcc 3 00 
agagatttat tggctggtca gtctcttgtg aaatagtatc atgtgagaaa cagtttgtag 360 
aaaaaaacta tacctgggaa gacctttgca acattgttcc ttccatgggc caagactcag 42.0 
ttaggaggca taaatctgcc cggaataaac taggccagga tacagccatg tttagttaat 480 
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aatttggttt tagaattcac acaggcagga ttggtttttt tgtgtcttgg caagtggagc 540 
atatttaaca tacaggcatg ggaatcctgc ctcttagctt ttcccaccct cttgtctcac 600 
caagtttttt ctctccaaag gtttccagga atttctcatt aatggctgat gcaaacttag 660 
tgaataataa tgaatataaa caatgctcac ctcaccaaaa ttatattatt tgcagtcatt 720 
tgtgataaca caaattttat cgcaatggtt attatttaat ttgtggccac acactgtggt 780 
tatcttttgt tgtggttgtt tctgagaaaa tgttcttgga tatgtaagtg ccaataccag 840 
tgtgaagtat tgatcccggg cagcaaaata cagcctaagg tttgtaaaca tcaattctat 900 
ctcagttcat cagagggcct gagaagctgc ggggcagtgt aaagtaaagt atgctgggct 960 
ggtggtggtc agcctcccgc ctgaagagtg accagtgctg gcccgacgga tcgctgagat 1020 
attctcccat aatggcaaaa aaataggcag tttgatgtga cctgtttagt gtggctctcc 1080 
tcttttgagc atgtgttagc atttttattt tatactcatc cagtgaactc tgctcttcca 1140 
agtgtgttca tgtatgtgct agatatatta gcacagcctg ccttctgctg cacaacgcct 1200 
tagagacccg gcctttcaat gagcttagct tgtgctctgt ttctgctctc ttaggtctaa 1260 
actatggtgt cagttttaat agaacaaaag tatgcatctt gccttggctt gagccttttc 1320 
gttttcaatg ctgacttctc ccctttctct cctgtgctca ccttaccttt ccagagtgta 1380 
agggacaact tttaaggagg cgtgtccctg gtaggggcat ccctgttcac caggtgcctg 1440 
tcatcacccc acttgactga catctaccct ggtgactatg ggttcctctt gtttgtaggg 1500 
aacggtggct ccaggtggag gcatcaatct gttgggttct ggttcccggc tgcctttggt 1560 
tttgaaagtc tcttctctgt atattcctac cctgcatttg ctttgtgtgg tgctgatgct 1620 
gtgcgcagta ggattcttgg atgactctcc atcagtcaca gactccccct gttgcaaagt 1680 
gtcaggctga ctcgacagtc accgtaaaat ctgagtcagt cacacacagg ctgtcagcca 1740 
cggcttccac ttgcatggct attctatttt cacacgtgag tttctgttgc tggctggctg 1800 
actggcatta tctatgctaa gttgaaatca ggagtgccca gcagagccca tcattctcac 1860 
tgtctttgaa acaaagctgt acggtttgat cgatgaacgt atttaaagca tttcatgcaa 1920 
tgacaaagtg ctcagtagtg gaaggcaggc tgtgaccagt ctgcctgctc cttactataa 1980 
ttgtgaggat ttgttactgg aacagtacat ggaggcctga ccttgtgggg gcacagggtg 2 04 0 
gaaccttagc tgaatatagt gtgtgtctca agaggaagtc agggtactag ctcagtgctc 2100 
aatctccagg tactatatat acatttgccc gttttatctc taatgtgaaa taaatcccca 2160 
aacacttgtt tatcgtgtag cgtacctaaa agact.attct attatgggtg tccccacttt 2220 
cttggtttgg tcaccccgat cccccggtct tctgctgtat ctagaacagt gactataaat 2280 
gatgtatggg aatagtgttt ccatatgatc tgttgtctgg agtatatgct acatgttcaa 2340 
ttactgtaca aaaacccagt gcagctgatg atgcaaagca gtctctctct gtgtacagtg 2400 
ccccacctat ttaaaaatca cgtacaascc cagaacactg tgaaacactt aacataagaa 2460 
caaacgcagc gtctggattc tttccaagga gagcagcttt ctccacagga acacagtaac 2520 
aaaagaggtc cgccgccatc cacacccagc caagacacct cagaggccat agggacaacc 2580 
tccttgctgg ccaacacctg ctggagcagg ggcacaggtc ccagcaactg atcctcagtg 2640 
gatgggtccg cagtcaaagc cttaatgggc tctcttttga aggggaaaga aagaatttca 2700 
agcttatgat atccaacatt attatagttg atgagttagt aaattccaaa aaaaaaagat 2760 
gattttatat gtatgacata aaaaaaatct ttgtaaagtg cgcaagtgca ataatttaaa 2820 
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gaggtcttat ctttgcattt ataaattata aatattgtac atgtgtgtaa tttttcatgt 2880 

attcatttgc agtctttgta tttaaaaaaa ctttactgtt atgtttgtat aatagaacat 2940 

taatcattta ttataactca gacaaggtgt aaataaattc ataattcaaa cagccagtat 3000 

atatgcatat atgggtgtta cattgcaaaa atctctatct ttgttctatt cacatgctta 3060 

aagaagtaag aaatcttttg tggatatgta attatacata taaagtatat atatatgtat 3120 

gatacatgaa atatatttag aaatgttcat aattttaatg gatattcttt ggtgtgaata 3180 

attgaataca acatttttaa aatgaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 3236 

<210> 2 
<211> 5752 
<212> DNA 
<213> mouse 

<400> 2 

aagtgtaaat aaaataaaca tctaataaaa aaaattacat accatagagg aacaagataa 60 
tttctgccca acttcatacc ctccagcgta tagtgttgag gtttggtctg ttgctgtgta 120 
ttgtaatgta atgttaaatt ctctacctga aggtctaggc ctacaagtga attctcatgt 180 
ttatagagtt ttgttgtgca aaccttgttc cttaatttaa aactatggtt aaaaaacaaa 240 
acaaaactgg ctacagccaa taactgaagg gggttacctt gttgaagggg tggaaaagag 300 
agaggaggaa gaagggagtt caagagaagg agaagaacaa gaggagagga ggaagctgcc 360 
acgaggggag atgggccatg agaacttggc caggagaaat agccagtatc tggagtacac 420 
cactgaggag gtagccaggc tagcagttag aagagtagat taggggttat ttttccccca 480 
ctccacatag ttatcaaagc caaataaaat aaccatagtc tgagtctcat ctatttgtaa 540 
gctagttggg tataagatta atttggctgt actacagttt agatttctaa cataggaact 600 
atcaaaaact tgctcaaaca agaacatgct gacaatattt taaaatgatt atttatattg 660 
tttgcacttt ctaaagtttc ttctaaatgt tccatggtca aattaaaaaa tatacatatt 720 
ggctattaaa ttcgtctaag tggggctgga gagatagctc agaggttaag agcactgact 780 
gctcttccag aggtcctgag ttcaattccc agcgaccaca tggtggctca cagccatctg 840 
taatagatag gatctgacgc cctcttctgg agtgtctgaa gacagctaca atgtactcat 900 
atatattaaa taaataatat tagaaaattc ttctaagtgt atcatttata gaatatttaa 960 
tatataaagt aaatgcctca ggaaatataa acttggaatt aaatcaaaga acttcatgag 1020 
tagtgggcca caaaaaatgt gtaccagggg aagaccggag ggaggggaga aggaagggat 1080 
ggagatagaa ttttgcctct gcattccttg ggctggcaca ggtataatgc tgtgggaatt 1140 
gggaaactac aaggaagctg caaagctggg cggaactcgt ttccgcaagc tgggctcatc 1200 
taagtgtcca tgcatggctg ccacactgca gtgaacttta aaacatttgt gttccagaga 1260 
tgtagagatg ctcacaatag tacaaaggcg ggagggaggt atttccagac taagaggaag 1320 
aaaaaccatt gctgattaaa catctgcata tgagcgcccc cacctccata cacacacaca 1380 
cacacacaca cacacacaca caaccaaaca gaacaaatac acatgcatgt ctacagcctg 1440 
caggaacaaa atggtatgtc tgtgaggaac caggagatgc acaggtccta acctctgtct 1500 
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cctacaagcc ctgaagtctg gtcagggtca aatgtacaaa agcaggctaa ggaagctgtt 1560 
tagtgaaaga tttttttctt caactctagg aacaacctat ttcctaggat ttggagagtg 1620 
ctcaggagga aacattcaga caactgatgc tctctgtgta ccccagattc aggtattggg 1680 
gtagttagtt gtgctcatgt atgtgctaga tatattagca cagcctgcct tctgctgcac 174 0 
aacgccttag agacccggcc tttcaatgag cttagcttgt gctctgtttc tgctctctta 1800 
ggtctaaact atggtgtcag ttttaataga acaaaagtat gcatcttgcc ttggcttgag 1860 
ccttttcgtt ttcaatgctg acttctcccc tttctctcct gtgctcacct tacctttcca 1920 
gagtgtaagg gacaactttt aaggaggcgt gtccctggta ggggcatccc tgttcaccag 1980 
gtgcctgtca tcaccccact tgactgacat ctaccctggt gactatgggt tcctcttgtt 2040 
tgtagggaac ggtggctcca ggtggaggca tcaatctgtt gggttctggt tcccggctgc 2100 
ctttggtttt gaaagtctct tctctgtata ttcctaccct gcatttgctt tgtgtggtgc 2160 
tgatgctgtg cgcagcagga ttcttggatg actctccatc agtcacagac tccccctgtt 2220 
gcaaagtgtc aggctgactc gacagtcacc gtaaaatctg agtcagtcac acacaggctg 2280 
tcagccacgg cttccacttg catggctatt ctattttcac acgtgagttt ctgttgctgg 2340 
ctggctgact ggcattatct atgctaagtt gaaatcaggg gtgcccagca gagcccatca 2400 
ttctcactgt ctttgaaaca aagctgtacg gtttgatcga tgaacgtatt taaagcattt 2460 
catgcaatga caaagtgctc agtagtggaa ggcaggctgt gaccagtctg cctgctcctt 2520 
actataattg tgaggatttg ttactggaac agtacatgga ggcctgacct tgtgggggca 2580 
cagggtggaa ccttagctga atatagtgtg tgtctcaaga ggaagtcagg gtactagctc 2640 
agtgctcaat ctccaggtac tatatataca tttgcccgtt ttatctctaa tgtgaaataa 2700 
atccccaaac acttgtttat cgtgtagcgt acctaaaaga ctattctatt atgggtgtcc 2760 
ccactttctt ggtttggtca ccccgatccc ccggtcttct gctgtatcta gaacagtgac 2820 
tataaatgat gtatgggaat agtgtttcca tatgatctgt tgtctggagt atatgctaca 2880 
tgttcattta ctgtacaaaa acccagtgca gctgatgatg caaagcagtc tctctctgtg 2 94 0 
tacagtgccc cacctattta aaaatcacgt acttgcccag aacactgtga aacacttaac 3 000 
ataagaacaa acgcagcgtc tggattcttt ccaaggagag cagctttctc cacaggaaca 3060 
cagtaacaaa agaggtccgc cgccatccac acccagccaa gacacctcag aggccatagg 3120 
gacaacctcc ttgctggcca' acacctgctg gagcaggggc acaggtccca gcaactgatc 3180 
ctcagtggat gggtctgcag ccaaagcctt aatgggctct cttttgaagg ggaaagaaag 3240 
aatttcaagc ttatgatatc caatattatt atagttgatg agttagtaaa ttccaaaaaa 3300 
aaaagatgat tttatatgta tgacataaaa aaaatctttg taaagtgcgc aagtgcaata 3360 
atttaaagag gtcttatctt tgcatttata aattataaat attgtacatg tgtgtaattt 3420 
ttcatgtatt catttgcagt ctttgtattt aaaaaaactt tactgttatg tttgtataat 3480 
agaacattaa tcatttatta taactcagac aaggtgtaaa taaattcata attcaaacag 3540 
ccagtatata tgcatatatg ggtgttacat tgcaaaaatc tctatctttg ttctattcac 3600 
atgcttaaag aagtaagaaa tcttttgtgg atatgtaatt atacatataa agtatatata 3660 
tatgtatgat acatgaaata tatttagaaa tgttcataat tttaatggat attctttggt 3720 
gtgaataatt gaatacaaca tttttaaaat aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3780 
aaaatttttt tttttttttt ttattccaga gattaaagac actagatctt taaccttgaa 3840 
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gggcaggcaa gaggtcggca atgctgtcaa catagaagtc agggaccatt ttcttcttga 3900 
acatgcagtc actttcctga ttgctcttca catcctcaag gctccggaat tccgggggtg 3960 
tggtgggctt tgatctcagg actctggagg cagaagcagg cagatctctg tgaatatgag 4020 
gccagcctgc actacacaga gctccagacc agtcatggct acatcatgaa accctgtctc 4080 
aaaaagaaaa taaaaactgt tgtgtttcta ccatagtgtt aaactcagag tctgagtaat 4140 
gtcgggctga catgctcggg tgtttaacat accttcagct ttgacgaggc gctgaacagt 4200 
caaagtctgg ccttggggag cggtggctgt gtttgtgctc aagtccaccg tgaaatcctg 4260 
attgtgaatt tggacaaccg tgtccttctt cttggccttc catgcaacct ccaacttcat 4320 
gttggtcatt ttgtcaaaac actgtgtgat gtttttatca atatactgcc attccacata 4380 
tgtagagatg tagtctgcct ggctttcctt ttctttagcc aatcgaatgc tcttgatcat 4440 
gccctcaatc tcatctctag cttttatcac gtctctgcta attcctgaaa cttgaatcga 4500 
agttttcttc tggttcatct caatggtgat gttcagttcc ttctgaatct cattcagttt 4560 
ctcgtactcc tccatgtcaa agtcactgac acactcatcg tcattggtgt aggaaagctg 4620 
ctctttggta atcagttcct ttagccagga gattgttttg ttcacactgt ctacccctga 4680 
aocacatacc tggaaaactg tgtgctctat tttcttttcc aaaaccaggg tgttcttttt 4740 
gggggaagct tgcttgggaa agccaagaaa ggctaaagag aaaatggaaa ttaatgtttc 4800 
ttttactccc ttcaacatca aggttaggaa tatgtatttc ataaaagcta acaactcaca 4860 
ggcaatctta gacatcactg actgcttggc aggcgactgc ttggggggag ctggagagcc 4920 
ttctctttct ttcatgttgt cgtaaaaaaa ttgcagaata tggggctgga agataacaac 4980 
tttaactctc ttcacagcct gcactgattt tttctggaca aattcttcaa tggcatctat 5040 
tatcgctttt gctactacgt ttgggtcctg ttgagcattt ccttcaaaaa caaaaaaagc 5100 
acatttttaa aaagtcaagg ttaagatcca cctgcaaaaa aaagctgcaa tataagcgag 5160 
gaattctagt tgtcacagga aataaaaatg tctgttccca ctataatcaa tgtagactga 5220 
taatattatg ccagcaaata gttttgaagt cctaggcaca gtgggaggag gttttgttcc 5280 
acgctgttca taagccaata ccccagcaaa agaccttaaa ggacaacttg taatttggga 5340 
cattcacatc tgtcctcttc atctgatctg gctcccagtg tcactctcta acacggtcct 5400 
tagagggaca atttatccct gcctctgctt gatcttatgc atgtatctgt attcttccag 5460 
ccatccctgg cgacctgatt tttctaaggc acccaaaact gtaagctact tcttataatc 5520 
tataattctg agcatattag ttagcctgag cctccaggat atctttcttc cctatactca 5580 
gtccagtttt agctgcccag aaggattcaa agctgatcta cgagtagatc actcctgtct 5640 
acagcttgtt ccagatcttg tttctcaagc cctggaagcc atcagccagg taagattgta 5700 
aaacaatccc tttctaatca tgggtgtggc ccaaagtgaa tggccggaat tc 5752 

<210> 3 
<211> 475 
<212> DNA 
<213> mouse 

<400> 3 
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tgtatgggaa tagtgtttcc atatgatctg ttgtctggag tatatgctac atgttcattt 60 
actgtacaaa aacccagtgc agctgatgat gcaaagcagt ctctctctgt gtacagtgcc 120 
ccacctattt aaaaatcacg tacttgccca gaacactgtg aaacacttaa cataagaaca 180 
aacgcagcgt ctggattctt tccaaggaga gcagctttct ccacaggaac acagtaacaa 240 
aagaggtccg ccgccatcca cacccagcca agacacctca gaggccatag ggacaacctc 300 
cttgctggcc aacacctgct ggagcagggg cacaggtccc agcaactgat cctcagtgga 360 
tgggtctgca gccaaagcct taatgggctc tcttttgaag gggaaagaaa gaatttcaag 420 
cttatgatat ccaatattat tatagttgat gagttagtaa attccaaaaa aaaaa 475 

<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 4 

agggctgtca atcatgctgg 20 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 5 

aaactcacgg tcggtgcagc 20 

<210> 6 
<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: probe 
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attaaccctc actaaatgct gtat 

<210> 7 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : probe 
<400> 7 

cattatgctg agtgatatct ttttttttcg 

<210> 8 
<211> 38 
<212> DNA 

<213> Artificial Sec[uence 

<220> 

<223> Description of Artificial Sequence : probe 
<400> 8 

gaacatgtag catatactcc agacaacaga tcatatgg 

<210> 9 
<211> 32 
<212> DNA 

<213> Artificial S'ec[uence 
<220> 

<223> Description of Artificial Sequence: probe 
<400> 9 

cagcttctcc acaggaacac agtaacaaag ag 

<210> 10 
<211> 35 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 10 

ctatttcaca agagactgac cagccaataa atctc 35 

<210> 11 
<211> 7581 
<212> DNA 
<213> Unknown 

<400> 11 

cgcccgggca ggtctgttgg agggcagttg gtcaacctga ccagagagag ctgagctgga 60 
agaccccact gatggtgtgc tgcctttcag tccaggaaga aagaaaggaa ggattctgag 120 
gatttgggca aagccacatt cctggagaag tctgtatact gatgccaaac ccaagagctg 180 
agctgctgat gaggcccagg gagtagccca cgcgccctga gctgttggct agcaaggcct 240 
tcctgctcca tgtggcatgg aaaaattata tggtttgacg gatgaaaagg tgaaggccta 3 00 
tctttctctc catccccagg tattagatga atttgtttct gaaagtgtta gtgcagagac 360 
tgtggaaaag tggctgaaga ggaaaaccaa caaagcaaaa gatgaaccat ctcccaagga 420 
agtcagcagg taccaggata cgaatatgca gggagtcgtg tacgagctga acagctacat 4 80 
agagcagcgc ctggacacgg gcggggacaa ccacctgctc ctctatgagc tcagcagcat 54 0 
catcaggata gccacaaaag ccgacggatt tgcactgtac ttccttggag agtgcaataa 600 
tagcctgtgt gtgttcatac cacccgggat gaaggaaggc caaccccggc tcatccctgc 660 
agggcccatc acccagggta ccaccatctc tgcctacgtg gccaagtcta ggaagacgtt 720 
gttggtagag gatatccttg gggatgagcg atttcctcga ggtactggcc tggaatcagg 780 
aacccgcatc cagtctgttc tttgcttgcc cattgtcact gccattggag acttgattgg 840 
catccttgaa ctgtacaggc actggggcaa agaggccttc tgcctcagcc atcaggaggt 900 
tgcaacagcc aatcttgctt gggcttccgt agcaatacac caggtgcagg tgtgtagagg 960 
tctcgccaaa cagaccgaac tgaatgactt cctactcgac gtatcaaaga catactttga 1020 
taacatagtt gccatagact ctctacttga acacatcatg atatatgcaa aaaatctagt 1080 
gaacgccgac cgctgcgcgc tcttccaggt ggaccacaag aacaaggagc tgtactcgga 1140 
cctgtttgac attggggagg agaaggaggg gaagcccatc ttcaagaaga ccaaggagat 1200 
cagattttcc attgagaaag ggattgctgg tcaagtggca agaacaggcg aagtcttgaa 1260 
cattcccgat gcctacgcgg accctcgctt taacagggag gtggacctgt acacaggcta 1320 
caccacgagg aacattctgt gtatgcccat agtgagccga ggcagcgtga ttggcgtggt 13 80 
gcagatggtg aacaagatca gcggtagcgc cttctccaag acagacgaga acaacttcaa 144 0 
gatgtttgct gtcttctgcg cactggcctt gcactgtgct aacatgtacc acaggatccg 1500 
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ccactcagaa tgcatctaca gggttaccat ggagaagctt tcctaccaca gcatctgcac 1560 
ctccgaggag tggcaaggcc tcatgcgctt caacctacca gcacgcatct gccgggacat 1620 
cgagctattc cactttgaca ttggtccttt cgagaacatg tggcctggga tctttgtcta 1680 
catgatccat cggtcttgtg ggacatcctg ttttgaactt gaaaaattgt gccgttttat 1740 
catgtctgtg aagaagaact atcggcgggt tccttaccac aactggaagc atgcagtcac 1800 
ggtggcacac tgcatgtatg ccatacttca aaacaacaat ggcctcttca cagacctcga 1860 
gcgcaaaggc ctgctaattg cgtgtctgtg ccatgacctg gaccacaggg gcttcagtaa 1920 
cagctacctg cagaagttcg accaccccct ggcggcgctg tactccacct ccaccatgga 1980 
gcaacaccac ttctcccaga cggtgtccat ccttcagctg gaagggcaca atatcttctc 2040 
caccctgagc tccagcgagt acgagcaggt gctggagatc atccgcaaag ccatcatcgc 2100 
caccgacctc gccctatact ttgggaacag gaagcagttg gaggagatgt accagacagg 2160 
gtcgctgaac ctccacaacc agtcccatcg agaccgtgtc atcggcttga tgatgactgc 2220 
ctgtgatctt tgctctgtga ccaaactatg gccagttaca aaattgacag cgaatgatat 2280 
atatgcagaa ttctgggctg agggtgatga gatgaagaag ctgggcatac agcccattcc 2340 
tatgatggac agagacaagc gagatgaagt ccctcaaggg cagctcggat tctacaatgc 2400 
tgtggccatt ccctgctata ccaccttgac gcagatcctc ccacccacag agcctctgct 2460 
gaaggcctgc agggataacc tcaatcagtg ggagaaggta attcgcgggg aagagacagc 2520 
aatgtggatt tcaggcccag gcccggcgcc tagcaagagc acacctgaga agctgaacgt 2580 
gaaggttgaa gactgatcct gaagtgacgt cctgatgtct gcccagcaac cgactcaacc 2640 
tgcttctgtg acttcgttct ttttgttttc aaggggtgaa aaccccctgt cagaaggtac 2700 
cgtcgcatat ccatgtgaag cagacgactc cctgcttgcc gcacacacct cggacagtga 2760 
gcaacccagg ctctgccgtg ttcagacgtc ggctactccg tggctccacc tgacctccga 2 82 0 
atgctatttg ctcccaggcc agcactgcac tgtctggagg gggcagagac cacaggagag 28 80 
gttcttgcct gcatcctccc atgagggtgt ggccagttcc ctagttctgt gccatgctgc 2 94 0 
tgcttggtgg cattggttag gaatgggaca cacgcccctt gttgtgaagt ttacatgtga 3000 
ccttcttata ggttaactga gtttgtggcc tggacacatg taatgaaggt cacagtccac 3060 
aggtgacaga gaaatccaaa ctgttgatta caggtgcact acaggtatgc tctttcagtc 3120 
tatctggggg cacataggtg agtctgctcc actcagaann aagcatacct ctgccctcat 3180 
ccaggggaca cagggtacat cccaggcatc ggggaactga agctctcact tcaaaccatg 3240 
tcaaagaatt aaaacacctc ccctccccct cactgtagcc ttcgacaact gcgccaatcc 3300 
ctttatacaa agaaaataaa agtaaggcat ataaatttcc tccagcaagc aaatcttgtg 3360 
ggtaaaaaaa aagcatgtga atnntaacaa cntctanant ntcncngnat gttatggcag 3420 
aattttagtc acgtccaaaa caaaaagatt attccagaag atacctcatc ctatgcctga 34 80 
aaggctccac agcatggcgt ccgtctccca gggttctgat ccgtctcctc acggtgcaat 3540 
caggcaggac agagaggagg gctgcagggc taccacattg acccagaagg tatctcctct 3600 
caccattcag acatccataa ggaatgccaa atgctgtatt gaatagttct ctgtgtgact 3 660 
ttctagagaa gccaggacac cctgagcctt tccnggggaa ctctaaggag tcacaggttc 3720 
acaccgtggg gattttcagg atagcatgga gacagagatc cggtcgttgt tctcactcgt 3 780 
gagccttgag aaggagagac tgaccagaaa cactcactca gcactctgca ggagcaggag 3840 
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aagatacttt aagatgaatc ttggatagat tttgatacac ccaataccat acacacagga 3900 
gcttggcatt tgcaaagtct attcagtttc cttccgcgct ctgacccacg gttgtagcgg 3960 
astgggctga acactgtaac actgtacatg cgatttcccc atgggcttct aaaatgtcac 4020 
catctcctcc cctgctgtgt cctactccat ttactggtta caaggtgatg tcaacaagag 4080 
aagctatcac aacaccaggg ctgtgcacac gtgcacacac atgtatgcac aagcacacag 4140 
atgtatgtac agcacacaca cacacacaca ccccaaaagg agagaaaagg aagaaaacat 4200 
ttataaaaag cgacagctac cccatatcaa aatagtcttt cctgtaggaa acaggagctc 4260 
tccataagga attatcatga gtgtgttctc ccatcagtgc actctcccag gggtgctcac 4320 
tgaagctggt ccacrtctat aaacaggtga cactggctgc agcaaaaagc cattcgatcc 4380 
acacaaattg atcttctatc atcttggaat ctgaattgca gggaggagca gyatgtaaga 4440 
cgaccgttta attcaggcat tccgaaggca tgagcgcatg gattctrtca ccaagcgtat 4500 
aaaaggaccc tggcattggg aaacctatga cggactgttt ttgctgtaga agtagggatt 4560 
ttacagaagt ctccttgrat ttgccctgcc tggggcagtt ttgcagagga acctgccaga 4620 
gatttattgg ctggtcagtc tcttgtgaaa tagtatcatg tgagaaacag tttgtagaaa 4680 
aaaactatac ctgggaagac ctttgcaaca ttgttccttc catgggccaa gactcagtta 4740 
ggaggcataa atctgcccgg aataaactag gccaggatac agccatgttt agttaataat 4800 
ttggttttag aattcacaca ggcaggattg gtttttttgt gtcttggcaa gtggagcata 4860 
tttaacatac aggcatggga atcctgcctc ttagcttttc ccaccctctt gtctcaccaa 4920 
gttttttctc tccaaaggtt tccaggaatt tctcattaat ggctgatgca aacttagtga 4 980 
ataataatga atataaacaa tgctcacctc accaaaatta tattatttgc agtcatttgt 5040 
gataacacaa attttatcgc aatggttatt atttaatttg tggccacaca ctgtggttat 5100 
cttttgttgt ggttgtttct gagaaaatgt tcttggatat gtaagtgcca ataccagtgt' 5160 
gaagtattga tcccgggcag caaaatacag cctaaggttt gtaaacatca attctatctc 5220 
agttcatcag agggcctgag aagctgcggg gcagtgtaaa gtaaagtatg ctgggctggt 5280 
ggtggtcagc ctccccttgc caagaagaga gcaattgaat cctgtcccca gctccctcca 5340 
cgcctgaaga gtgaccagtg ctggcccgac ggatcgctga gatattctcc cataatggca 5400 
aaaaaatagg cagtttgatg tgacctgttt agtgtggctc tcctcttttg agcatgtgtt 5460 
agcattttta ttttatactc atccagtgaa ctctgctctt ccaagtgtgt tcatgtatgt 5520 
gctagatata ttagcacagc ctgccttctg ctgcacaacg ccttagagac ccggcctttc 5580 
aatgagctta gcttgtgctc tgtttctgct ctcttaggtc taaactatgg tgtcagtttt 5640 
aatagaacaa aagtatgcat cttgccttgg cttgagcctt ttcgttttca atgctgactt 5700 
ctcccctttc tctcctgtgc tcaccttacc tttccagagt gtaagggaca acttttaagg 5760 
aggcgtgtcc ctggtagggg catccctgtt caccaggtgc ctgtcatcac cccacttgac 5820 
tgacatctac cctggtgact atgggttcct cttgtttgta gggaacggtg gctccaggtg 5880 
gaggcatcaa tctgttgggt tctggttccc ggctgccttt ggttttgaaa gtctcttctc 594 0 
tgtatattcc taccctgcat ttgctttgtg tggtgctgat gctgtggcag taggatcttg 6000 
gatgactctc catcagtcac agactccccc tgttgcaaag tgtcaggctg actcgacagt 6060 
caccgtaaaa tctgagtcag tcacacacag gctgtcagcc acggcttcca cttgcatggc 612 0 
tattctattt tcacacgtga gtttctgttg ctggctggct gactggcatt atctatgcta 6180 
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agttgaaatc aggagtgtgc ccagcagagc ccatcattct cactgtcttt gaaacaaagc 6240 
tgtacggttt gatcgatgaa cgtatttaaa gcatttcatg caatgacaaa gtgctcagta 6300 
gtggaaggca ggctgtgacc agtctgcctg ctccttacta taattgtgag gatttgttac 6360 
tggaacagta catggaggcc tgaccttgtg ggggcacagg gtggaacctt agctgaatat 6420 
agtgtgtgtc tcaagaggaa gtcagggtac tagctcagtg ctcaatctcc aggtactata 6480 
tatacatttg cccgttttat ctctaatgtg aaataaatcc ccaaacactt gtttatcgtg 6540 
tagcgtacct aaaagactat tctattatgg gtgtccccac tttcttggtt tggtcacccc 6600 
gatcccccgg tcttctgctg tatctagaac agtgactata aatgatgtat gggaatagtg 6660 
tttccatatg atctgttgtc tggagtatat gctacatgtt catttactgt acaaaaaccc 6720 
agtgcagctg atgatgcaaa gcagtctctc tctgtgtaca gtgccccacc tatttaaaaa 6780 
tcacgtacaa ncccagaaca ctgtgaaaca cttaacataa gaaacaaacg cagcgtctgg 6840 
attctttcca aggagagcag ctttctccac aggaacacag taacaaaaga ggtccgccgc 6900 
catccacacc cagccaagac acctcagagg ccatagggac aacctccttg ctggccaaca 6960 
cctgctggag cagggcacag gtcccagcaa ctgatcctca gtggatgggt ccgcagtcaa 7020 
agccttaatg ggctctcttt tgaaggggaa agaaannttt caagcttatg atatccaaca 7080 
ttattatagt tgatgagtta gtaaattccg aaaaaaaaag atgattttat atgtatgaca 7140 
taaaaaaaat ctttgtaaag tgcgcaagtg caataattta aagaggtctt atctttgcat 7200 
ttataaatta taaatattgt acatgtgtgt aatttttcat gtattcattt gcagtctttg 7260 
tatttaaaaa aactttactg ttatgtttgt ataatagaac attaatcatt tattataact 7320 
cagacaaggt gtaaataaat tcataattca aacagccagt atatatgcat atatgggtgt 7380 
tacattgcaa aaatctctat ctttgttcta ttcacatgct taaagaagta agaaatcttt 744 0 
tgtggatatg taattataca tataaagtat atatatatgt atgatacatg aaatatattt 7500 
agaaatgttc ataattttaa tggatattct ttggtgtgaa taattgaata caacattttt 7560 
aaaatgaaaa aaaaaaaaaa c 7581 

<210> 12 
<211> 7618 
<212> DNA 
<213> mouse 

<400> 12 

cgcccgggca ggtctgttgg agggcagttg gtcaacctga ccagagagag ctgagctgga 60 
agaccccact gatggtgtgc tgcctttcag tccaggaaga aagaaaggaa ggattctgag 120 
gatttgggca aagccacatt cctggagaag tctgtatact gatgccaaac ccaagagctg 180 
agctgctgat gaggcccagg gagtagccca cgcgccctga gctgttggct agcaaggcct 240 
tcctgctcca tgtggcatgg aaaaattata tggtttgacg gatgaaaagg tgaaggccta 300 
tctttctctc catccccagg tattagatga atttgtttct gaaagtgtta gtgcagagac 360 
tgtggaaaag tggctgaaga ggaaaaccaa caaagcaaaa gatgaaccat ctcccaagga 420 
agtcagcagg taccaggata cgaatatgca gggagtcgtg tacgagctga acagctacat 480 
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agagcagcgc ctggacacgg gcggggacaa ccacctgctc ctctatgagc tcagcagcat 540 
catcaggata gccacaaaag ccgacggatt tgcactgtac ttccttggag agtgcaataa 600 
tagcctgtgt gtgttcatac cacccgggat gaaggaaggc caaccccggc tcatccctgc 660 
agggcccatc acccagggta ccaccatctc tgcctacgtg gccaagtcta ggaagacgtt 720 
gttggtagag gatatccttg gggatgagcg atttcctcga ggtactggcc tggaatcagg 780 
aacccgcatc cagtctgttc tttgcttgcc cattgtcact gccattggag acttgattgg 840 
catccttgaa ctgtacaggc actggggcaa agaggccttc tgcctcagcc atcaggaggt 900 
tgcaacagcc aatcttgctt gggcttccgt agcaatacac caggtgcagg tgtgtagagg 960 
tctcgccaaa cagaccgaac tgaatgactt cctactcgac gtatcaaaga catactttga 1020 
taacatagtt gccatagact ctctacttga acacatcatg atatatgcaa aaaatctagt 1080 
gaacgccgac cgctgcgcgc tcttccaggt ggaccacaag aacaaggagc tgtactcgga 1140 
cctgtttgac attggggagg agaaggaggg gaagcccatc ttcaagaaga ccaaggagat 1200 
cagattttcc attgagaaag ggattgctgg tcaagtggca agaacaggcg aagtcttgaa 1260 
cattcccgat gcctacgcgg accctcgctt taacagggag gtggacctgt acacaggcta 1320 
caccacgagg aacattctgt gtatgcccat agtgagccga ggcagcgtga ttggcgtggt 1380 
gcagatggtg aacaagatca gcggtagcgc cttctccaag acagacgaga acaacttcaa 1440 
gatgtttgct gtcttctgcg cactggcctt gcactgtgct aacatgtacc acaggatccg 1500 
ccactcagaa tgcatctaca gggttaccat ggagaagctt tcctaccaca gcatctgcac 1560 
ctccgaggag tggcaaggcc tcatgcgctt caacctacca gcacgcatct gccgggacat 1620 
cgagctattc cactttgaca ttggtccttt cgagaacatg tggcctggga tctttgtcta 1680 
catgatccat cggtcttgtg ggacatcctg ttttgaactt gaaaaattgt gccgttttat 1740 
catgtctgtg aagaagaact atcggcgggt tccttaccac aactggaagc atgcagtcac 1800 
ggtggcacac tgcatgtatg ccatacttca aaacaacaat ggcctcttca cagacctcga 1860 
gcgcaaaggc ctgctaattg cgtgtctgtg ccatgacctg gaccacaggg gcttcagtaa 1920 
cagctacctg cagaagttcg accaccccct ggcggcgctg tactccacct ccaccatgga 1980 
gcaacaccac ttctcccaga cggtgtccat ccttcagctg gaagggcaca atatcttctc 2040 
caccctgagc tccagcgagt acgagcaggt gctggagatc atccgcaaag ccatcatcgc 2100 
caccgacctc gccctatact ttgggaacag gaagcagttg gaggagatgt accagacagg 2160 
gtcgctgaac ctccacaacc agtcccatcg agaccgtgtc atcggcttga tgatgactgc 2220 
ctgtgatctt tgctctgtga ccaaactatg gccagttaca aaattgacag cgaatgatat 2280 
atatgcagaa ttctgggctg agggtgatga gatgaagaag ctgggcatac agcccattcc 2340 
tatgatggac agagacaagc gagatgaagt ccctcaaggg cagctcggat tctacaatgc 2400 
tgtggccatt ccctgctata ccaccttgac gcagatcctc ccacccacag agcctctgct 2460 
gaaggcctgc agggataacc tcaatcagtg ggagaaggta attcgcgggg aagagacagc 2520 
aatgtggatt tcaggcccag gcccggcgcc tagcaagagc acacctgaga agctgaacgt 2580 
gaaggttgaa gactgatcct gaagtgacgt cctgatgtct gcccagcaac cgactcaacc 2640 
tgcttctgtg acttcgttct ttttgttttc aaggggtgaa aaccccctgt cagaaggtac 2700 
cgtcgcatat ccatgtgaag cagacgactc cctgcttgcc gcacacacct cggacagtga 2760 
gcaacccagg ctctgccgtg ttcagacgtc ggctactccg tggctccacc tgacctccga 2820 
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atgctatttg ctcccaggcc agcactgcac tgtctggagg gggcagagac cacaggagag 2880 
gttcttgcct gcatcctccc atgagggtgt ggccagttcc ctagttctgt gccatgctgc 2940 
tgcttggtgg cattggttag gaatgggaca cacgcccctt gttgtgaagt ttacatgtga 3000 
ccttcttata ggttaactga gtttgtggcc tgggacacat gtaatgaagg tcacagtcca 3060 
caggtgacag agaaatccaa actgttgatt acaggtgcac tacaggtatg ctctttcagt 3120 
ctatctgggg gcacataggt gagtctgctc cactcagaag gaagcatacc tctsccctca 3180 
tccaggggac acagggtaca tcccaggcat cggggaactg aagctctcac ttcaaaccat 3240 
gtcaaagaat taaaacacct cccctccccc tcactgtagc cttcggcaac tgcgccaatc 3300 
cctttataca aagaaaatat aagtaaggca tataaatttc ctccagcaag caaatcttgt 3360 
gggtaaaaaa aaaaaatgtg aattttaaca acctctatat tttcactgta tgttatggca 3420 
gaattttagt cacgtccaaa acaaaagatt attccagaag atacctcatc ctatgcctga 3480 
aagctccaca gcatggcgtc cgtctcccag ggttctgatc cgtctcctca cggtgcaatc 3540 
aggcaggaca ggaggaggtg cagggctacc acattgaccc agatggtatc tcctctcacc 3600 
attcagacat ccataaggaa tgccaaatgc tgtattgaat agttctcctg tgtgactttc 3660 
tagagaagcc aggacacccc tgagcctttc ctgggaactc ctaaggaagt cacaggttca 3720 
caccgtgggg attttcagga tagcatggag accagagaat cccggttcgg ttgttctcac 3780 
tcggtgagcc ttgagaagga agagactgac cagaaacact cactcagcac tctggcagga 3840 
gcaggagaag atactttaag atgaatcttt gggatagatt ttgatacacc caataccata 3900 
cacacaggag cttggcattt gcaaagtcta ttcagtttcc ttccacactc tgacccacgg 3960 
ttgtagcgga gtgggctgaa cactgtaaca ctgtacatgc gatttcccca tgggcttcta 4 02 0 
aaatgtcacc atctcctccc ctgctgtgtc ctactccatt tactggttac aaggtgatgt 4080 
caacaagaga agctatcaca acaccagggc tgtgcacacg tgcacacaca tgtatgcaca 414 0 
agcacacaga tgtatgtaca gcacacacac acacacacac cccaaaagga gagaaaagga 4200 
agaaaacatt tataaaaagc gacagctacc cccatattca aaaatagttc ttttccctgt 4260 
agggaaacag gtagctctcc ataaggaaat tatcatgagt gtgttctccc atcagtgcac 4S20 
ttctcccagg ggtgctcact gaagctggtc cacgtctata aacaggtgac actggctgca 4380 
gcaaaaagcc attcgatcca cacaaattga tcttctatca tcttggaatc tgaattgcag 4440 
ggaggagcag catgtaagac gaccgtttaa ttcaggcatt ccgaaggcat gagcgcatgg 4500 
attctgtcac caagcgtata aaaggaccct ggcattggga aacctatgac ggactgtttt 4560 
tgctgtagaa gtagggattt tacagaagtc tccttggatt tgccctgcct ggggcagttt 4620 
tgcagaggaa cctgccagag atttattggc tggtcagtct cttgtgaaat agtatcatgt 4680 
gagaaacagt ttgtagaaaa aaactatacc tgggaagacc tttgcaacat tgttccttcc 4740 
atgggccaag actcagttag gaggcataaa tctgcccgga ataaactagg ccaggataca 4800 
gccatgttta gttaataatt tggttttaga attcacacag gcaggattgg tttttttgtg 4860 
tcttggcaag tggagcatat ttaacataca ggcatgggaa tcctgcctct tagcttttcc 4920 
caccctcttg tctcaccaag ttttttctct ccaaaggttt ccaggaattt ctcattaatg 4980 
gctgatgcaa acttagtgaa taataatgaa tataaacaat gctcacctca ccaaaattat 504 0 
attatttgca gtcatttgtg ataacacaaa ttttatcgca atggttatta tttaatttgt 5100 
ggccacacac tgtggttatc ttttgttgtg gttgtttctg agaaaatgtt cttggatatg 5160 
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taagtgccaa taccagtgtg aagtattgat cccgggcagc aaaatacagc ctaaggtttg 5220 
taaacatcaa ttctatctca gttcatcaga gggcctgaga agctgcgggg cagtgtaaag 5280 
taaagtatgc tgggctggtg gtggtcagcc tccccttgcc aagaagagag caattgaatc 534 0 
ctgtccccag ctccctccac gcctgaagag tgaccagtgc tggcccgacg gatcgctgag 5400 
atattctccc ataatggcaa aaaaataggc agtttgatgt gacctgttta gtgtggctct 5460 
cctcttttga gcatgtgtta gcatttttat tttatactca tccagtgaac tctgctcttc 5520 
caagtgtgtt catgtatgtg ctagatatat tagcacagcc tgccttctgc tgcacaacgc 5580 
cttagagacc cggcctttca atgagcttag cttgtgctct gtttctgctc tcttaggtct 5640 
aaactatggt gtcagtttta atagaacaaa agtatgcatc ttgccttggc ttgagccttt 5700 
tcgttttcaa tgctgacttc tcccctttct ctcctgtgct caccttacct ttccagagtg 5760 
taagggacaa cttttaagga ggcgtgtccc tggtaggggc atccctgttc accaggtgcc 5820 
tgtcatcacc ccacttgact gacatctacc ctggtgacta tgggttcctc ttgtttgtag 5880 
ggaacggtgg ctccaggtgg aggcatcaat ctgttgggtt ctggttcccg gctgcctttg 5940 
gttttgaaag tctcttctct gtatattcct accctgcatt tgctttgtgt ggtgctgatg 6000 
ctgtggcagt aggatcttgg atgactctcc atcagtcaca gactccccct gttgcaaagt 6060 
gtcaggctga ctcgacagtc accgtaaaat ctgagtcagt cacacacagg ctgtcagcca 6120 
cggcttccac ttgcatggct attctatttt cacacgtgag tttctgttgc tggctggctg 6180 
actggcatta tctatgctaa gttgaaatca ggagtgtgcc cagcagagcc catcattctc 6240 
actgtctttg aaacaaagct gtacggtttg atcgatgaac gtatttaaag catttcatgc 6300 
aatgacaaag tgctcagtag tggaaggcag gctgtgacca gtctgcctgc tccttactat 6360 
aattgtgagg atttgttact ggaacagtac atggaggcct gaccttgtgg gggcacaggg 642 0 
tggaacctta gctgaatata gtgtgtgtct caagaggaag tcagggtact agctcagtgc 64 80 
tcaatctcca ggtactatat atacatttgc ccgttttatc tctaatgtga aataaatccc 6540 
caaacacttg tttatcgtgt agcgtaccta aaagactatt ctattatggg tgtccccact 6600 
ttcttggttt ggtcaccccg atcccccggt cttctgctgt atctagaaca gtgactataa 6660 
atgatgtatg ggaatagtgt ttccatatga tctgttgtct ggagtatatg ctacatgttc 6720 
atttactgta caaaaaccca gtgcagctga tgatgcaaag cagtctctct ctgtgtacag 6780 
tgccccacct atttaaaaat cacgtacaan cccagaacac tgtgaaacac ttaacataag 6840 
aaacaaacgc agcgtctgga ttctttccaa ggagagcagc tttctccaca ggaacacagt 6900 
aacaaaagag gtccgccgcc atccacaccc agccaagaca cctcagaggc catagggaca 6960 
acctccttgc tggccaacac ctgctggagc agggcacagg tcccagcaac tgatcctcag 7020 
tggatgggtc cgcagtcaaa gccttaatgg gctctctttt gaaggggaaa gaaanntttc 7080 
aagcttatga tatccaacat tattatagtt gatgagttag taaattccga aaaaaaaaga 7140 
tgattttata tgtatgacat aaaaaaaatc tttgtaaagt gcgcaagtgc aataatttaa 7200 
agaggtctta tctttgcatt tataaattat aaatattgta catgtgtgta atttttcatg 7260 
tattcatttg cagtctttgt atttaaaaaa actttactgt tatgtttgta taatagaaca 7320 
ttaatcattt attataactc agacaaggtg taaataaatt cataattcaa acagccagta 7380 
tatatgcata tatgggtgtt acattgcaaa aatctctatc tttgttctat tcacatgctt 7440 
aaagaagtaa gaaatctttt gtggatatgt aattatacat ataaagtata tatatatgta 7500 



14/15 



wo 01/24781 PCT/CAOO/01188 

tgatacatga aatatattta gaaatgttca taattttaat ggatattctt tggtgtgaat 7560 
aattgaatac aacattttta aaatgaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 7618 
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